WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 : 

C07H 21704, C12N 15/00, 1/20, C07K 
14/435, 16/00, C12P 21/02, G01N 33/566, 
33/53, C12Q 1/68 



A2 



(11) International Publication Number: WO 99/54343 

(43) International Publication Date: 28 October 1999 (28.10.99) 



(21) International Application Number: PCT/US99/08896 

(22) International Filing Date: 23 April 1999 (23.04.99) 



(30) Priority Data: 

09/065,661 



23 April 1998 (23.04.98) 



US 



(71) Applicant: MILLENNIUM BIOTHERAPEUTICS, INC. 

[US/US]; 620 Memorial' Drive, Cambridge, MA 02142 
(US). 

(72) Inventor: HOLTZMAN, Douglas; 821 Centre Sheet #6, 

Jamaica Plain, MA 02130 (US). 

(74) Agent: MEIKLEJOHN, Anita, L.; Fish & Richardson P.C., 225 
Franklin Street, Boston, MA 021 10-2804 (US). 



(81) Designated States: AU, CA, JP, European patent (AT, BE, 
CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE). 



Published 

With declaration under Article 17(2)(a); without abstract; 
title not checked by the International Searching Authority. 



(54) Title: NOVEL MOLECULES OF THE T139-RELATED PROTEIN FAMILY AND USES THEREOF 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


n 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


Ul 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Artrbaijaa 


GB 


United Kingdom 


MC 


Monaco 


TO 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


Gil 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


brae) 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZVY 


Zimbabwe 


CI 


COte d'tvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


IX 


Saint Lucia 


RU 


Russian Federation 






D£ 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


56 


Sweden 






BE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 99/54343 



PCT/US99/08896 



NOVEL MOLECULES OF THE T 1 3 9 - RELATED PROTEIN 
FAMILY AND USES THEREOF 
5 Background of the Invention 

The invention relates to a novel secreted protein 
and the gene encoding it. 

Many membrane-associated and secreted proteins, 
for example, cytokines, play a vital role in the 
10 regulation of cell growth, cell differentiation, and a 
variety of specific cellular responses. A number of 
medically useful proteins, including erythropoietin, 
granulocyte -macrophage colony stimulating factor, human 
growth hormone, and various inter leukins, are secreted 
15 proteins. Thus, an important goal in the design and 
development of new therapies is the identification and 
characterization of membrane-associated and secreted 
proteins and the genes which encode them. 

Many membrane -associated proteins are receptors 
20 which bind a ligand and transduce an intracellular 

signal, leading to a variety of cellular responses. The 
identification and characterization of such a receptor 
enables one to identify both the ligands which bind to 
the receptor and the intracellular molecules and signal 
25 transduction pathways associated with the receptor, 
permitting one to identify or design modulators of 
receptor activity, e.g., receptor agonists or antagonists 
and modulators of signal transduction. 

Summary of the Invention 
30 The present invention is based, at least in part, 

on the discovery of a gene encoding T13 9. The T13 9 cDNA 
described below (SEQ ID N0:1) has a 1338 nucleotide open 
reading frame (nucleotides 95-1432 of SEQ ID N0:1; SEQ ID 
NO: 3) which encodes a 446 amino acid protein (SEQ ID 
35 NO: 2) . This protein includes a predicted signal sequence 
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of about 26 amino acids (from about amino acid 1 to about 
amino acid 26 of SEQ ID NO: 2) and a predicted mature 
protein of about 420 amino acids (from about amino acid 
27 to amino acid 446 of SEQ ID NO:2; SEQ ID NO: 4) . T139 
5 protein possesses a sperm-coating protein (SCP) domain 
(amino acids 47 to 190, of SEQ ID NO:2; SEQ ID NO:5), a 
C-type lectin domain (amino acids 297 to 412, of SEQ ID 
NO: 2; SEQ ID NO: 6), and two epidermal growth factor 
(EGF)-like domains (amino acids 232 to 260 of SEQ ID 

10 NO: 2; SEQ ID NO: 7, referred to herein as the "EGF1 

domain" and amino acids 264 to 291 of SEQ ID NO: 2; SEQ ID 
NO: 8, referred to herein as the "EGF2 domain"). 

The T139 molecules of the present invention are 
useful as modulating agents in regulating a variety of 

15 cellular processes. Accordingly, in one aspect, this 
invention provides isolated nucleic acid molecules 
encoding T139 proteins or biologically active portions 
thereof, as well as nucleic acid fragments suitable as 
primers or hybridization probes for the detection of 

20 T13 9 -encoding nucleic acids. 

The invention features a nucleic acid molecule 
which is at least 45% (or 55%, 65%, 75%, 85%, 95%, or 
98%) identical to the nucleotide sequence shown in SEQ ID 
NO:l, or SEQ ID NO: 3, or the nucleotide sequence of the 

25 cDNA insert of the plasmid deposited with ATCC as 
Accession Number (the "cDNA of ATCC 98694"), or a 
complement thereof. The invention features a nucleic 
acid molecule which includes a fragment of at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

30 800, 900, 1000, or 1290) nucleotides of the nucleotide 
sequence shown in SEQ ID N0:1, or SEQ ID NO: 3, or the 
nucleotide sequence of the cDNA of ATCC 98694, or a 
complement thereof . 

The invention also features a nucleic acid 

35 molecule which includes a nucleotide sequence encoding a 
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protein having an amino acid sequence that is at least 
45% (or 55%, 65%, 75%, 85%, 95%, or 98%) identical to the 
amino acid sequence of SEQ ID NO: 2, SEQ ID NO: 4, or the 
amino acid sequence encoded by the cDNA of ATCC 98694. 
5 In a preferred embodiment, a T139 nucleic acid molecule 
has the nucleotide sequence shown SEQ ID N0:1, or SEQ ID 
NO: 3, or the nucleotide sequence of the cDNA of ATCC 
98694. 

Also within the invention is a nucleic acid 
10 molecule which encodes a fragment of a polypeptide having 
the amino acid sequence of SEQ ID NO: 2 or SEQ ID NO:4, 
the fragment including at least 15 (25, 30, 50, 100, 150, 
300, or 400) contiguous amino acids of SEQ ID NO: 2 or SEQ 
ID NO: 4 or the polypeptide encoded by the cDNA of ATCC 
15 98694. 

The invention includes a nucleic acid molecule 
which encodes a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 

20 the cDNA of ATCC 98694, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l or SEQ ID NO: 3 under stringent conditions. 

Also within the invention are: an isolated T139 
protein having an amino acid sequence that is at least 

25 about 65%, preferably 75%, 85%, 95%, or 98% identical to 
the amino acid sequence of SEQ ID NO: 4 (mature human 
T139) or the amino acid sequence of SEQ ID NO: 2 (immature 
human T139) ; and an isolated T139 protein having an amino 
acid sequence that is at least about 85%, 95%, or 98% 

30 identical to the SCP-like domain of SEQ ID NO:2 (e.g., 
about amino acid residues 47 to 190 of SEQ ID NO:2) , C- 
type lectin domain (e.g., about amino acid residues 297 
to 412 of SEQ ID NO:2) , and EGF-like domains (e.g., about 
amino acids residues 232 to 260 or 264 to 291 of SEQ ID 

35 NO:2) . 
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300 (325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 
700, 800, 900, 1000, or 1290) nucleotides in length and 
hybridizes under stringent conditions to a nucleic acid 
molecule comprising the nucleotide sequence shown in SEQ 
5 ID NO:l, SEQ ID NO: 3, the cDNA of ATCC 98694, or a 
complement thereof. In a preferred embodiment, an 
isolated T139 nucleic acid molecule comprises nucleotides 
233 to 665 of SEQ ID NO:l, encoding the SCP-like domain 
of T139; nucleotides 983 to 1330 of SEQ ID NO:l, encoding 

10 the C-type lectin domain of T139; or nucleotides 788 to 
874 or 884 to 967 of SEQ ID NO:l, encoding a EGF like 
domain of T139, or a complement thereof. In another 
embodiment, the invention provides an isolated nucleic 
acid molecule which is antisense to the coding strand of 

15 a T139 nucleic acid. 

Another aspect of the invention provides a vector, 
e.g. , a recombinant expression vector, comprising a T13 9 
nucleic acid molecule of the invention. In another 
embodiment the invention provides a host cell containing 

20 such a vector. The invention also provides a method for 
producing T139 protein by culturing, in a suitable 
medium, a host cell of the invention containing a 
recombinant expression vector such that a T13 9 protein is 
produced . 

25 Another aspect of this invention features isolated 

or recombinant T139 proteins and polypeptides. Preferred 
T139 proteins and polypeptides possess at least one 
biological activity possessed by naturally occurring 
human T139, e.g., (1) the ability to form protein: protein 

30 interactions with proteins; and (2) the ability to bind a 
T139 ligand. Other activities include: (1) modulation of 
cellular proliferation and (2) modulation of cellular 
differentiation . 

The T139 proteins of the present invention, or 

35 biologically active portions thereof, can be operatively 
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linked to a non-T139 polypeptide (e.g., heterologous 
amino acid sequences) to form T13 9 fusion proteins. The 
invention further features antibodies that specifically 
bind T139 proteins, such as monoclonal or polyclonal 
5 antibodies. In addition, the T139 proteins or 

biologically active portions thereof can be incorporated 
into pharmaceutical compositions, which optionally 
include pharmaceutical ly acceptable carriers. 

In another aspect, the present invention provides 

10 a method for detecting the presence of T139 activity or 
expression in a biological sample by contacting the 
biological sample with an agent capable of detecting an 
indicator of T139 activity such that the presence of T139 
activity is detected in the biological sample. 

15 In another aspect, the invention provides a method 

for modulating T139 activity comprising contacting a cell 
with an agent that modulates (inhibits or stimulates) 
T139 activity or expression such that T139 activity or 
expression in the cell is modulated. In one embodiment, 

20 the agent is an antibody that specifically binds to T139 
protein. In another embodiment, the agent modulates 
expression of T13 9 by modulating transcription of a T139 
gene, splicing of a T139 mRNA, or translation of a T139 
mRNA. In yet another embodiment, the agent is a nucleic 

25 acid molecule having a nucleotide sequence that is 

antisense to the coding strand of the T139 mRNA or the 
T139 gene. 

In one embodiment, the methods of the present 
invention are used to treat a subject having a disorder 

30 characterized by aberrant T139 expression or activity by 
administering an agent which is a T13 9 modulator to the 
subject. In one embodiment, the T139 modulator is a T139 
protein. In another embodiment the T139 modulator is a 
T139 nucleic acid molecule. In other embodiments, the 

35 T139 modulator is a peptide, peptidomimetic, or other 
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small molecule. In a preferred embodiment, the disorder 
characterized by aberrant T139 protein or nucleic acid 
expression is a proliferative or dif f erentiative 
disorder, particularly of the immune system. 
5 T139 nucleic acids, polypeptides, and modulators 

of T13 9 expression or activity can be used to modulate 
spermatogenesis. For example, such molecules can be used 
as a contraceptive to decrease spermatogenesis . 
Alternatively, such molecules can be used to increase 

10 spermatogenesis. Such molecules may also be useful for 
treatment of disorders related to defects in sperm-egg 
fusion and disorders of acrosome formation or function. 
T139 nucleic acids, polypeptides, and modulators may also 
be useful for treatment of various testicular disorders, 

15 e.g., testicular cancer. T139 may serve as an antigen in 
a contraceptive vaccinogen. 

The present invention also provides a diagnostic 
assay for identifying the presence or absence of a 
genetic lesion or mutation characterized by at least one 

20 of: (i) aberrant modification or mutation of a gene 
encoding a T139 protein; (ii) mis-regulation of a gene 
encoding a T139 protein; and (iii) aberrant post- 
translational modification of a T139 protein, wherein a 
wild- type form of the gene encodes a protein with a T139 

25 activity. 

In another aspect, the invention provides a 
method for identifying a compound that binds to or 
modulates the activity of a T139 protein. In general, 
such methods entail measuring a biological activity of a 
30 T139 protein in the presence and absence of a test 

compound and identifying those compounds which alter the 
activity of the T139 protein. 

The invention also features methods for 
identifying a compound which modulates the expression of 
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T139 by measuring the expression of T139 in the presence 
and absence of a compound. 

Other features and advantages of the invention 
will be apparent from the following detailed description 
5 and claims. 

Brief Description of the Drawings 
Figure 1 depicts the cDNA sequence (SEQ ID N0:1) 
and predicted amino acid sequence (SEQ ID NO: 2) of human 
T139 (also referred to as "TANGO 139") . The open reading 

10 frame of SEQ ID N0:1 extends from nucleotide 95 to 
nucleotide 1432 (SEQ ID N0:3). 

Figures 2A-2C depict alignments of portions the 
amino acid sequences of T139 with various consensus 
sequences. Figure 2A shows the alignment of T139 amino 

15 acids 47 to 190 of SEQ ID NO: 2 with the SCP-like domain 
consensus sequence derived from a hidden Markov model 
(PF00188; SEQ ID NO: 9) . Figure 2B shows the alignment of 
T139 amino acids 297 to 412 of SEQ ID NO: 2 with the C- 
type lectin domain consensus sequence derived from a 

20 hidden Markov model (PF00059; SEQ ID NO:10) . Figure 2C 
shows the alignment of T139 amino acids 232 to 260 (EGF1) 
and 264 to 291 (EGF2) of SEQ ID NO: 2 with the EGF-like 
domain consensus sequence derived from a hidden Markov 
model (PF00008; SEQ ID NO:ll). 

25 Figure 3 is a hydropathy plot of T13 9. The 

position of cysteines (cys) are indicated by the vertical 
bars immediately below the plot. Relative hydrophobicity 
is shown above the dotted line, and relative 
hydrophilicity is shown below the line. 

3 0 Detailed Description of the Invention 

The present invention is based on the discovery of 

a cDNA molecule encoding human T139. 

A nucleotide sequence encoding a human T139 

protein- is shown in Figure 1 (SEQ ID NO:l; SEQ ID NO: 3 
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includes the open reading frame only) . A predicted amino 
acid sequence of T139 protein is also shown in Figure 1 
(SEQ ID NO: 2) . 

The T139 cDNA of Figure 1 (SEQ ID N0:1), which is 
5 approximately 1856 nucleotides long, including 

untranslated regions, encodes a protein amino acid having 
a molecular weight of approximately 49 kDa (excluding 
post-translational modifications) . A plasmid containing 
a cDNA encoding human T139 was deposited with American 

10 Type Culture Collection (ATCC) , Rockville, Maryland on 
March 12, 1998, and assigned Accession Number 98694. 
This deposit will be maintained under the terms of the 
Budapest Treaty on the International Recognition of the 
Deposit of Microorganisms for the Purposes of Patent 

15 Procedure. This deposit was made merely as a convenience 
for those of skill in the art and is not an admission 
that a deposit is required under 35 U.S.C. §112. 

Sequence analysis revealed that T139 is homologous 
to testis-specif ic protein-1 (TPX-1) , a member of the 

2 0 SCP-like domain protein family. Comparison of the T139 

SCP-like domain with the SCP-like domain consensus (SEQ 
ID NO: 9) revealed that the T139 SCP-like domain is 28% 
identical (45/162 amino acids) and 50% similar (81/162 
amino acids) to the consensus. 
25 Alignment of the C-type lectin domain of human 

T139 protein (SEQ ID NO: 6) with the C-type lectin domain 
consensus sequence (SEQ ID NO: 10) revealed that the 
domains are 27% identical (28/103 amino acids) and 63% 
similar (65/103 amino acids) . C-type lectin domains 

3 0 appear to function as calcium-dependent carbohydrate- 

recognition domains and contain four conserved cysteines. 
The first and fourth cysteines and the second and third 
cysteines in the consensus participate in disulfide 
bonding with each other. One example of a protein having 
35 a C-type lectin domain is the REG protein, a 166 amino 
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acid polypeptide shown to stimulate beta-cell 
regeneration in a adult mouse pancreas. For a review on 
the REG protein, see Baeza et al. (1996) Diab. Metab. 
22:229-234. 

5 Alignment of the EGF-like domains of human T139 

protein (SEQ ID NOS:7 and 8) with the EGF-like domain 
consensus sequence (SEQ ID NO: 11) revealed that the EGF1 
domain is 38% identical (13/34 amino acids) and 71% 
similar (24/34 amino acids) . In general, EGF-like 

10 domains are found in the extracellular portion of 

membrane -bound proteins or in secreted proteins. EGF- 
like domains typically include six cysteine residues 
involved in disulfide bond formation with two conserved 
glycines between the fifth and sixth cysteine. The 

15 secondary structure of EGF-like domains appears to be a 
two- stranded £- sheet followed by a loop to a C- terminal 
short two-stranded sheet. 

Tango 139 is expressed at high levels in the 
kidney and at low levels in the testis as an about 2.0 kb 

20 transcript. Additional T139 transcripts of about 2.4 kb 
and 3.5 kb were also present in these two tissues. No 
T139 expression was observed in the heart, brain, 
placenta, lung, liver, skeletal muscle, pancreas, spleen, 
thymus, ovaries, small intestine, colon, and peripheral 

25 blood leukocytes. 

Human T139 is one member of a family of molecules 
(the "T139 family") having certain conserved structural 
and functional features. The term "family" when 
referring to the protein and nucleic acid molecules of 

30 the invention is intended to mean two or more proteins or 
nucleic acid molecules having a common structural domain 
and having sufficient amino acid or nucleotide sequence 
identity as defined herein. Such family members can be 
naturally occurring and can be from either the same or 

35 different species. For example, a family can contain a 
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first protein of human origin and a homologue of that 
protein of murine origin, as well as a second, distinct 
protein of human origin and a murine homologue of that 
protein. Members of a family may also have common 
5 functional characteristics. 

In one embodiment, a T139 protein includes a SCP- 
like, C-type lectin, or EGF-like domain having at least 
about 65%, preferably at least about 75%, and more 
preferably about 85%, 95%, or 98% amino acid sequence 

10 identity to the SCP-like domain of SEQ ID NO: 5, C-type 
lectin domain of SEQ ID NO:6, or EGF-like domain of SEQ 
ID N0S:7 or 8. 

Preferred T139 polypeptides of the present 
invention have an amino acid sequence sufficiently 

15 identical to the SCP-like domain of SEQ ID NO: 5, C-type 
lectin domain of SEQ ID NO: 6, or EGF-like domain of SEQ 
ID NOS:7 or 8. As used herein, the term "sufficiently 
identical" refers to a first amino acid or nucleotide 
sequence which contains a sufficient or minimum number of 

20 identical or equivalent (e.g., an amino acid residue 
which has a similar side chain) amino acid residues or 
nucleotides to a second amino acid or nucleotide sequence 
such that the first and second amino acid or nucleotide 
sequences have a common structural domain and/or common 

25 functional activity. For example, amino acid or 

nucleotide sequences which contain a common structural 
domain having about 65% identity, preferably 75% 
identity, more preferably 85%, 95%, or 98% identity are 
defined herein as sufficiently identical. 

30 As used interchangeably herein a "T139 activity", 

"biological activity of T139" or "functional activity of 
T139", refers to an activity exerted by a T139 protein, 
polypeptide or nucleic acid molecule on a T13 9 responsive 
cell as determined in vivo, or in vitro, according to 

35 standard techniques. A T139 activity can be a direct 
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activity, such as an association with or an enzymatic 
activity on a second protein or an indirect activity, 
such as a cellular signaling activity mediated by 
interaction of the T139 protein with a second protein. 
5 In a preferred embodiment, a T13 9 activity includes at 
least one or more of the following activities: (i) 
interaction with other proteins; and (ii) interaction 
with a T13 9 receptor. 

Accordingly, another embodiment of the invention 

10 features isolated T139 proteins and polypeptides having a 
T13 9 activity. 

Yet another embodiment of the invention features 
T13 9 molecules which contain a signal sequence. 
Generally, a signal sequence (or signal peptide) is a 

15 peptide containing about 20 amino acids which occurs at 
the extreme N-terminal end of secretory and integral 
membrane proteins and which contains large numbers of 
hydrophobic amino acid residues and serves to direct a 
protein containing such a sequence to a lipid bilayer. 

20 Various aspects of the invention are described in 

further detail in the following subsections. 

I. Isolated Nucleic Acid Molecules 

One aspect of the invention pertains to isolated 
nucleic acid molecules that encode T139 proteins or 

25 biologically active portions thereof, as well as nucleic 
acid molecules sufficient for use as hybridization probes 
to identify T139-encoding nucleic acids (e.g., T139 mRNA) 
and fragments for use as PCR primers for the 
amplification or mutation of T139. nucleic acid molecules. 

30 As used herein, the term "nucleic acid molecule" is 

intended to include DNA molecules (e.g., cDNA or genomic 
DNA) and RNA molecules (e.g., mRNA) and analogs of the 
DNA or RNA generated using nucleotide analogs. The 
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nucleic acid molecule can be single-stranded or double- 
stranded, but preferably is double-stranded DNA. 

An "isolated" nucleic acid molecule is one which 
is separated from other nucleic acid molecules which are 
5 present in the natural source of the nucleic acid. 
Preferably, an "isolated" nucleic acid is free of 
sequences (preferably protein encoding sequences) which 
naturally flank the nucleic acid (i.e., sequences located 
at the 5' and 3' ends of the nucleic acid) in the genomic 

10 DNA of the organism from which the nucleic acid is 
derived. For example, in various embodiments, the 
isolated T139 nucleic acid molecule can contain less than 
about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 
nucleotide sequences which naturally flank the nucleic 

15 acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived. Moreover, an "isolated" nucleic 
acid molecule, such as a cDNA molecule, can be 
substantially free of other cellular material, or culture 
medium when produced by recombinant techniques, or 

2 0 substantially free of chemical precursors or other 
chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention, 
e.g., a nucleic acid molecule having the nucleotide 
sequence of SEQ ID NO:l, SEQ ID NO: 3, or the cDNA of ATCC 

25 98694, or a complement of any of these nucleotide 
sequences, can be isolated using standard molecular 
biology techniques and the sequence information provided 
herein. Using all or portion of the nucleic acid 
sequences of SEQ ID N0:1, SEQ ID NO: 3, or the cDNA of 

30 ATCC 98694 as a hybridization probe, T139 nucleic acid 
molecules can be isolated using standard hybridization 
and cloning techniques (e.g., as described in Sambrook et 
al., eds.. Molecular Cloning: A Laboratory Manual . 2nd, 
ed., Cold Spring Harbor Laboratory, Cold Spring Harbor 

35 Laboratory Press, Cold Spring Harbor, NY, 1989) . 
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A nucleic acid of the invention can be amplified 
using cDNA, mRNA or genomic DNA as a template and 
appropriate oligonucleotide primers according to standard 
PCR amplification techniques. The nucleic acid so 
5 amplified can be cloned into an appropriate vector and 
characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to T139 nucleotide 
sequences can be prepared by standard synthetic 
techniques, e.g., using an automated DNA synthesizer. 

10 In another preferred embodiment, an isolated 

nucleic acid molecule of the invention comprises a 
nucleic acid molecule which is a complement of the 
nucleotide sequence shown in SEQ ID NO:l, SEQ ID NO: 3, or 
the cDNA of ATCC 98694, or a portion thereof. A nucleic 

15 acid molecule which is complementary to a given 
nucleotide sequence is one which is sufficiently 
complementary to the given nucleotide sequence that it 
can hybridize to the given nucleotide sequence thereby 
forming a stable duplex. 

20 Moreover, the nucleic acid molecule of the 

invention can comprise only a portion of a nucleic acid 
sequence encoding T139, for example, a fragment which can 
be used as a probe or primer or a fragment encoding a 
biologically active portion of T139. The nucleotide 

25 sequence determined from the cloning of the human T139 
gene allows for the generation of probes and primers 
designed for use in identifying and/or cloning T139 
homologues in other cell types, e.g., from other tissues, 
as well as T13 9 homologues from other mammals. The 

30 probe/primer typically comprises substantially purified 
oligonucleotide. The oligonucleotide typically comprises 
a region of nucleotide sequence that hybridizes under 
stringent conditions to at least about 12, preferably 
about 25, more preferably about 50, 75, 100, 125, 150, 

35 175, 200, 250, 300, 350 or 400 consecutive nucleotides of 
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the sense or ant i- sense sequence of SEQ ID N0:1, SEQ ID 
NO:3, or the cDNA of ATCC 98694 or of a naturally 
occurring mutant of SEQ ID N0:1, SEQ ID NO: 3, or the cDNA 
of ATCC 98694. 

5 Probes based on the human T139 nucleotide sequence 

can be used to detect transcripts or genomic sequences 
encoding the same or identical proteins. The probe 
comprises a label group attached thereto, e.g., a 
radioisotope/ a fluorescent compound, an enzyme f or an 

10 enzyme co-factor. Such probes can be used as a part of a 
diagnostic test kit for identifying cells or tissue which 
mis-express a T139 protein, such as by measuring a level 
of a T139-encoding nucleic acid in a sample of cells from 
a subject, e.g., detecting T139 mRNA levels or 

15 determining whether a genomic T139 gene has been mutated 
or deleted. 

A nucleic acid fragment encoding a "biologically 
active portion of T139" can be prepared by isolating a 
portion of SEQ ID NO:l, SEQ ID NO: 3, or the nucleotide 

20 sequence of the cDNA of ATCC 98694 which encodes a 

polypeptide having a T139 biological activity, expressing 
the encoded portion of T139 protein (e.g., by recombinant 
expression in vitro) and assessing the activity of the 
encoded portion of T139. For example, a nucleic acid 

25 fragment encoding a biologically active portion of T139 
includes a SCP-like, C-type lectin, or EGF-like domain, 
e.g., any of SEQ ID NOS:5-8. 

The invention further encompasses nucleic acid 
molecules that differ from the nucleotide sequence of SEQ 

30 ID NO:l, SEQ ID NO: 3, or the cDNA of ATCC 98694 due to 
degeneracy of the genetic code and thus encode the same 
T139 protein as that encoded by the nucleotide sequence 
shown in SEQ ID NO:l, SEQ ID NO: 3, or the cDNA of ATCC 
98694. 
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In addition to the human T139 nucleotide sequence 
shown in SEQ ID N0:1, SEQ ID NO: 3 , or the cDNA of ATCC 
98694, it will be appreciated by those skilled in the art 
that DNA sequence polymorphisms that lead to changes in 
5 the amino acid sequences of T13 9 may exist within a 
population (e.g., the human population). Such genetic 
polymorphism in the T139 gene may exist among individuals 
within a population due to natural allelic variation. As 
used herein, the terms "gene" and "recombinant gene" 

10 refer to nucleic acid molecules comprising an open 
reading frame encoding a T139 protein, preferably a 
mammalian T139 protein. Such natural allelic variations 
can typically result in 1-5% variance in the nucleotide 
sequence of the T139 gene. Any and all such nucleotide 

15 variations and resulting amino acid polymorphisms in T139 
that are the result of natural allelic variation and that 
do not alter the functional activity of T13 9 are intended 
to be within the scope of the invention. 

Moreover, nucleic acid molecules encoding T139 

20 proteins from other species (T139 homologues) , which have 
a nucleotide sequence which differs from that of a human 
T13 9, are intended to be within the scope of the 
invention. Nucleic acid molecules corresponding to 
natural allelic variants and homologues of the T13 9 cDNA 

25 of the invention can be isolated based on their identity 
to the human T13 9 nucleic acids disclosed herein using 
the human cDNAs, or a portion thereof, as a hybridization 
probe according to standard hybridization techniques 
under stringent hybridization conditions. For example, a 

3 0 membrane -bound human T139 cDNA can be isolated based on 
its identity to soluble T139. 

Accordingly, in another embodiment, an isolated 
nucleic acid molecule of the invention is at least 300 
(325, 350, 375, 400, 425, 450, 500, 550, 600, 650, 700, 

35 800, 900, 1000, or 1290) nucleotides in length and 
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hybridizes under stringent conditions to the nucleic acid 
molecule comprising the nucleotide sequence, preferably 
the coding sequence, of SEQ ID N0:1, SEQ ID NO: 3, or the 
cDNA of ATCC 98694. 
5 As used herein, the term "hybridizes under 

stringent conditions" is intended to describe conditions 
for hybridization and washing under which nucleotide 
sequences at least 60% {65%, 70%, preferably 75%) 
identical to each other typically remain hybridized to 

10 each other. Such stringent conditions are known to those 
skilled in the art and can be found in Current Protocols 
in Molecular Biology, John Wiley & Sons, N. Y. (1989) , 
6,3.1-6,3.6. A preferred, non-limiting example of 
stringent hybridization conditions are hybridization in 

15 6X sodium chloride/sodium citrate (SSC) at about 45 °C, 
followed by one or more washes in 0.2 X SSC, 0.1% SDS at 
50-65°C. Preferably, an isolated nucleic acid molecule 
of the invention that hybridizes under stringent 
conditions to the sequence of SEQ ID NO:l, SEQ ID NO:3, 

20 the cDNA of ATCC 98694 corresponds to a naturally- 
occurring nucleic acid molecule. As used herein, a 
"naturally-occurring" nucleic acid molecule refers to an 
RNA or DNA molecule having a nucleotide sequence that 
occurs in nature (e.g., encodes a natural protein). 

25 In addition to naturally-occurring allelic 

variants of the T139 sequence that may exist in the 
population, the skilled artisan will further appreciate 
that changes can be introduced by mutation into the 
nucleotide sequence of SEQ ID N0:1, SEQ ID NO: 3, the cDNA 

30 of ATCC 98694, thereby leading to changes in the amino 
acid sequence of the encoded T13 9 protein, without 
altering the functional ability of the T139 protein. For 
example, one can make nucleotide substitutions leading to 
amino acid substitutions at "non-essential" amino acid 

35 residues. A "non-essential" amino acid residue is a 
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residue that can be altered from the wild- type sequence 
of T139 (e.g., the sequence of SEQ ID N0:2) without 
altering the biological activity, whereas an "essential" 
amino acid residue is required for biological activity. 
5 For example, amino acid residues that are conserved among 
the T139 proteins of various species are predicted to be 
particularly unamenable to alteration. 

For example, preferred T139 proteins of the 
present invention, contain at least one SCP-like, C-type 

10 lectin, or EGF-like domain. Such conserved domains are 
less likely to be amenable to mutation. Other amino acid 
residues, however, (e.g., those that are not conserved or 
only semi -conserved among T139 of various species) may 
not be essential for activity and thus are likely to be 

15 amenable to alteration. 

Accordingly, another aspect of the invention 
pertains to nucleic acid molecules encoding T139 proteins 
that contain changes in amino acid residues that are not 
essential for activity. Such T13 9 proteins differ in 

2 0 amino acid sequence from SEQ ID NO: 2 yet retain 

biological activity. In one embodiment, the isolated 
nucleic acid molecule includes a nucleotide sequence 
encoding a protein that includes an amino acid sequence 
that is at least about 45% identical, 65%, 75%, 85%, 95%, 

25 or 98% identical to the amino acid sequence of SEQ ID 
NO:2. 

An isolated nucleic acid molecule encoding a T139 
protein having a sequence which differs from that of SEQ 
ID NO: 2 can be created by introducing one or more 

30 nucleotide substitutions, additions or deletions into the 
nucleotide sequence of SEQ ID N0:1, SEQ ID NO: 3, the cDNA 
of ATCC 98694 such that one or more amino acid 
substitutions, additions or deletions are introduced into 
the encoded protein. Mutations can be introduced by 

35 standard techniques, such as^ site-directed mutagenesis 
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and PCR-mediated mutagenesis. Preferably, conservative 
amino acid substitutions are made at one or more 
predicted non-essential amino acid residues. A 
"conservative amino acid substitution" is one in which 
5 the amino acid residue is replaced with an amino acid 
residue having a similar side chain. Families of amino 
acid residues having similar side chains have been 
defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, 

10 histidine) , acidic side chains (e.g., aspartic acid, 
glutamic acid), uncharged polar side chains (e.g.,* 
glycine , asparagine , glutamine , serine , threonine , 
tyrosine, cysteine), nonpolar side chains (e.g., alanine, 
valine , leucine , isoleucine , proline , phenylalanine , 

15 methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains 
(e.g., tyrosine, phenylalanine, tryptophan, histidine). 
Thus, a predicted nonessential amino acid residue in T139 
is preferably replaced with another amino acid residue 

20 from the same side chain family. Alternatively, 

mutations can be introduced randomly along all or part of 
a T139 coding sequence, such as by saturation 
mutagenesis, and the resultant mutants can be screened 
for T139 biological activity to identify mutants that 

25 retain activity. Following mutagenesis, the encoded 

protein can be expressed recombinantly and the activity 
of the protein can be determined. 

In a preferred embodiment, a mutant T13 9 protein 
can be assayed for: (1) the ability to form 

30 protein: protein interactions with proteins; or (2) the 
ability to bind a T139 receptor. In yet another 
preferred embodiment, a mutant T139 can be assayed for 
the ability to modulate cellular proliferation or 
cellular differentiation. 
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The present invention encompasses antisense 
nucleic acid molecules, i.e., molecules which are 
complementary to a sense nucleic acid encoding a protein, 
e.g., complementary to the coding strand of a double- 
5 stranded cDNA molecule or complementary to an mRNA 
sequence. Accordingly, an antisense nucleic acid can 
hydrogen bond to a sense nucleic acid. The antisense 
nucleic acid can be complementary to an entire T139 
coding strand, or to only a portion thereof, e.g., all or 

10 part of the protein coding region (or open reading 
frame) . An antisense nucleic acid molecule can be 
antisense to a noncoding region of the coding strand of a 
nucleotide sequence encoding T139. The noncoding regions 
("5' and 3' untranslated regions") are the 5' and 3' 

15 sequences which flank the coding region and are not 
translated into amino acids. 

Given the coding strand sequences encoding T139 
disclosed herein (e.g., SEQ ID NO:l or SEQ ID NO:3) , 
antisense nucleic acids of the invention can be designed 

20 according to the rules of Watson and Crick base pairing. 
The antisense nucleic acid molecule can be complementary 
to the entire coding region of T139 mRNA, but more 
preferably is an oligonucleotide which is antisense to 
only a portion of the coding or noncoding region of T139 

25 mRNA. For example, the antisense oligonucleotide can be 
complementary to the region surrounding the translation 
start site of T139 mRNA, e.g., an oligonucleotide having 
the sequence CTCTGGATGCAGCATGGGTCTGTTGGGCCC (SEQ ID 
NO: 12) or GATGCAGCATGGGTCTGTTG (SEQ ID NO: 13). An 

30 antisense oligonucleotide can be, for example, about 5, 
10, 15, 20, 25, 30, 35, 40, 45 or 50 nucleotides in 
length. An antisense nucleic acid of the invention can 
be constructed using chemical synthesis and enzymatic 
ligation reactions using procedures known in the art. 

35 For example, an antisense nucleic acid (e.g., an 
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antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological 
stability of the molecules or to increase the physical 
5 stability of the duplex formed between the antisense and 
sense nucleic acids, e.g., phosphorothioate derivatives 
and acridine substituted nucleotides can be used. 
Examples of modified nucleotides which can be used to 
generate the antisense nucleic acid include 5- 

10 f luorouracil, 5-bromouracil, 5-chlorouracil, 5- 

iodouracil, hypoxanthine , xanthine, 4- acetyl cytosine, 5- 
( carboxyhydroxylmethyl ) uracil , 5 - 
carboxymethylaminomethyl -2 -thiouridine , 5- 
carboxymethylaminomethyluracil , dihydrouracil , beta-D- 

15 galactosylqueosine, inosine, N6-isopentenyl adenine, 1- 
me thylguanine , 1 -methylinosine , 2 , 2 -dimethyl guanine , 2 - 
methyl adenine, 2-methylguanine, 3 -methyl cytosine, 5- 
methylcytosine, N6- adenine, 7 -me thylguanine, 5- 
methylaminomethyl uracil , 5 -methoxyaminomethyl - 2 - 

20 thiouracil, beta-D-mannosylqueosine, 5'- 

methoxycarboxymethyl uracil , 5 -methoxyuracil , 2 - 
methyl thio-N6-isopentenyladenine, uracil- 5 -oxyacetic acid 
(v) , wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine , 5-methyl -2 - thiouracil , 2 - thiouracil , 4 - 

25 thiouracil, 5-methyluracil, uracil -5 -oxyacetic acid 
methylester, uracil-5-oxyacetic acid (v) , 5-methyl-2- 
thiouracil, 3- (3-amino-3-N-2-carboxypropyl) uracil, 
(acp3)w, and 2 , 6-diaminopurine . Alternatively, the 
antisense nucleic acid can be produced biologically using 

30 an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA 
transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of 
interest, described further in the following subsection) . 
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The antisense nucleic acid molecules of the 
invention are typically administered to a subject or 
generated in situ such that they hybridize with or bind 
to cellular mRNA and/or genomic DNA encoding a T139 
5 protein to thereby inhibit expression of the protein, 
e.g., by inhibiting transcription and/or translation. 
The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, 
in the case of an antisense nucleic acid molecule which 

10 binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a 
route of administration of antisense nucleic acid 
molecules of the invention include direct injection at a 
tissue site. Alternatively, antisense nucleic acid 

15 molecules can be modified to target selected cells and 
then administered systemically . For example, for 
systemic administration, antisense molecules can be 
modified such that they specifically bind to receptors or 
antigens expressed on a selected cell surface, e.g., by 

20 linking the antisense nucleic acid molecules to peptides 
or antibodies which bind to cell surface receptors or 
antigens. The antisense nucleic acid molecules can also 
be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the 

25 antisense molecules, vector constructs in which the 
antisense nucleic acid molecule is placed under the 
control of a strong pol II or pol III promoter are 
preferred. 

An antisense nucleic acid molecule of the 
30 invention can be an a-anomeric nucleic acid molecule. An 
of-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, 
contrary to the usual /? -units, the strands run parallel 
to each other (Gaultier et al. (1987) Nucleic Acids. Res. 
35 15:6625-6641). The antisense nucleic acid molecule can 
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also comprise a 2 ' -o-methylribonucleotide (Inoue et al. 
(1987) Nucleic Acids Res. 15:6131-6148) or a chimeric 
RNA-DNA analogue (Inoue et al . (1987) FEBS Lett. 215:327- 
330) . 

5 The invention also encompasses ribozymes. 

Ribozymes are catalytic RNA molecules with ribonuclease 
activity which are capable of cleaving a single- stranded 
nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead 

10 ribozymes (described in Haselhoff and Gerlach (1988) 

Nature 334:585-591)) can be used to catalytically- cleave 
T139 mRNA transcripts to thereby inhibit translation of 
T139 mRNA. A ribozyme having specificity for a T139- 
encoding nucleic acid can be designed based upon the 

15 nucleotide sequence of a T139 cDNA disclosed herein 
(e.g., SEQ ID NO:l, SEQ ID N0:3) . For example, a 
derivative of a Tetrahymena L-19 IVS RNA can be 
constructed in which the nucleotide sequence of the 
active site is complementary to the nucleotide sequence 

20 to be cleaved in a T13 9 -encoding mRNA. See, e.g., Cech 
et al. U.S. Patent No. 4,987,071; and Cech et al . U.S. 
Patent No. 5,116,742. Alternatively, T139 mRNA can be 
used to select a catalytic RNA having a specific 
ribonuclease activity from a pool of RNA molecules. See, 

25 e.g., Bartel and Szostak (1993) Science 261:1411-1418. 

The invention also encompasses nucleic acid 
molecules which form triple helical structures. For 
example, T139 gene expression can be inhibited by 
targeting nucleotide sequences complementary to the 

30 regulatory region of the T139 (e.g., the T139 promoter 
and/or enhancers) to form triple helical structures that 
prevent transcription of the T139 gene in target cells. 
See generally, Helene (1991) Anticancer Drug Des. 
6(6):569-84; Helene (1992) Ann. N. Y. Acad. Sci. 660:27- 

35 36; and Maher (1992) Bioassays 14 (12) : 807-15 . 
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In preferred embodiments, the nucleic acid 
molecules of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, 
e.g., the stability, hybridization, or solubility of the 
5 molecule. For example, the deoxyribose phosphate 

backbone of the nucleic acids can be modified to generate 
peptide nucleic acids (see Hyrup et al . (1996) Bioorganic 
& Medicinal Chemistry 4 (1) :5-23) . As used herein, the 
terms "peptide nucleic acids" or "PNAs" refer to nucleic 

10 acid mimics, e.g., DNA mimics, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide - 
backbone and only the four natural nucleobases are 
retained. The neutral backbone of PNAs has been shown to 
allow for specific hybridization to DNA and RNA under 

15 conditions of low ionic strength. The synthesis of PNA 
oligomers can be performed using standard solid phase 
peptide synthesis protocols as described in Hyrup et al. 
(1996) supra; Perry-O' Keef e et al. (1996) Proc. Natl. 
Acad. Sci. USA 93:14670-675. 

20 PNAs of T13 9 can be used therapeutic and 

diagnostic applications. For example, PNAs can be used 
as antisense or antigene agents for sequence-specific 
modulation of gene expression by, e.g., inducing 
transcription or translation arrest or inhibiting 

25 replication. PNAs of T139 can also be used, e.g., in the 
analysis of single base pair mutations in a gene by, 
e.g., PNA directed PCR clamping; as 'artificial 
restriction enzymes when used in combination with other 
enzymes, e.g., SI nucleases (Hyrup (1996) supra; or as 

3 0 probes or primers for DNA sequence and hybridization 
(Hyrup (1996) supra; Perry-O' Keef e et al . (1996) Proc. 
Natl. Acad. Sci. USA 93:14670-675). 

In another embodiment, PNAs of T139 can be 
modified, e.g., to enhance their stability or cellular 

35 uptake, by attaching lipophilic or other helper groups to 
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PNA, by the formation of PNA-DNA chimeras, or by the use 
of liposomes or other techniques of drug delivery known 
in the art. For example, PNA-DNA chimeras of T139 can be 
generated which may combine the advantageous properties 
5 of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNAse H and DNA polymerases, to interact 
with the DNA portion while the PNA portion would provide 
high binding affinity and specificity. PNA-DNA chimeras 
can be linked using linkers of appropriate lengths 

10 selected in terms of base stacking, number of bonds 
between the nucleobases, and orientation (Hyrup«~ (1996) 
supra) . The synthesis of PNA-DNA chimeras can be 
performed as described in Hyrup (1996) supra and Finn et 
al. (1996) Nucleic Acids Research 24 (17) : 3357-63. For 

15 example, a DNA chain can be synthesized on a solid 

support using standard phosphoramidite coupling chemistry 
and modified nucleoside analogs, e.g., 5' -(4- 
methoxytrityl) amino-5' -deoxy- thymidine phosphoramidite, 
can be used as a between the PNA and the 5' end of DNA 

20 (Mag et al . (1989) Nucleic Acid Res. 17:5973-88). PNA 
monomers are then coupled in a stepwise manner to produce 
a chimeric molecule with a 5' PNA segment and a 3' DNA 
segment (Finn et al . (1996) Nucleic Acids Research 
24 (17) :3357-63) . Alternatively, chimeric molecules can 

25 be synthesized with a 5' DNA segment and a 3' PNA segment 
(Peterser et al. (1975) Bioorganic Med. Chem. Lett. 
5:1119-11124) . 

In other embodiments, the oligonucleotide may 
include other appended groups such as peptides (e.g., for 

30 targeting host cell receptors in vivo) , or agents 

facilitating transport across the cell membrane (see, 
e.g., Letsinger et al . (198$) Proc. Natl. Acad. Sci. USA 
86:6553-6556; Lemaitre et al . (1987) Proc. Natl. Acad. 
Sci. USA 84:648-652; PCT Publication No. W088/09810) or 

35 the blood-brain barrier (see, e.g., PCT Publication No. 
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W089/10134) . In addition, oligonucleotides can be 
modified with hybridization-triggered cleavage agents 
(see, e.g., Krol et al. (1988) Bio/Techniques 6:958-976) 
or intercalating agents (see, e.g., Zon (1988) Pharm. 
Res. 5:539-549). To this end, the oligonucleotide may be 
conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport 
agent, hybridization-triggered cleavage agent, etc. 

TT. Isolated T139 Proteins and Anti-T139 Antibodies 

One aspect of the invention pertains to isolated 
T139 proteins, and biologically active portions thereof, 
as well as polypeptide fragments suitable for use as 
immunogens to raise anti-T13 9 antibodies. In one 
embodiment, native T139 proteins can be isolated from . 
cells or tissue sources by an appropriate purification 
scheme using standard protein purification techniques. 
In another embodiment, T139 proteins are produced by 
recombinant DNA techniques. Alternative to recombinant 
expression, a T139 protein or polypeptide can be 
0 synthesized chemically using standard peptide synthesis 
. techniques . 

An "isolated" or "purified" protein or 
biologically active portion thereof is substantially free 
of cellular material or other contaminating proteins from 
5 the cell or tissue source from which the T139 protein is 
derived, or substantially free from chemical precursors 
or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" 
includes preparations of T139 protein in which the 
0 protein is separated from cellular components of the 
cells from which it is isolated or recombinantly 
produced. Thus, T139 protein that is substantially free 
of cellular material includes preparations of T13 9 
protein having less than about 30%, 20%, 10%, or 5% (by 
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dry weight) of non~T139 protein (also referred to herein 
as a "contaminating protein") - When the T13 9 protein or 
biologically active portion thereof is recombinantly 
produced, it is also preferably substantially free of 
5 culture medium, i.e., culture medium represents less than 
about 20%, 10%, or 5% of the volume of the protein 
preparation. When T139 protein is produced by chemical 
synthesis, it is preferably substantially free of 
chemical precursors or other chemicals, i.e., it is 

10 separated from chemical precursors or other chemicals 
which are involved in the synthesis of the protein. 
Accordingly such preparations of T13 9 protein have less 
than about 30%, 20%, 10%, 5% (by dry weight) of chemical 
precursors or non-T13 9 chemicals. 

15 Biologically active portions of a T139 protein 

include peptides comprising amino acid sequences 
sufficiently identical to or derived from the amino acid 
sequence of the T139 protein (e.g., the amino acid 
sequence shown in SEQ ID NO: 2 or SEQ ID NO: 4) , which 

2 0 include less amino acids than the full length T139 
proteins, and exhibit at least one activity of a T139 
protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of 
the T139 protein. A biologically active portion of a 

25 T139 protein can be a polypeptide which is, for example, 
10, 25, 50, 100 or more amino acids in length. Preferred 
biologically active polypeptides include one or more 
identified T139 structural domains, e.g., SCP-like domain 
(SEQ ID NO:5), C-type lectin domain (SEQ ID N0:6), or one 

30 of the EGF-like domains (SEQ ID N0:7 and 8). 

Moreover, other biologically active portions, in 
which other regions of the protein are deleted, can be 
prepared by recombinant techniques and evaluated for one 
or more of the functional activities of a native T139 

35 protein- Preferred T139 protein has the amino acid 
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sequence shown of SEQ ID NO:2. Other useful T139 
proteins are substantially identical to SEQ ID NO: 2 and 
retain the functional activity of the protein of SEQ ID 
NO: 2 yet differ in amino acid sequence due to natural 
5 allelic variation or mutagenesis. Accordingly, a useful 
T139 protein is a protein which includes an amino acid 
sequence at least about 45%, preferably 55%, 65%, 75%, 
85%, 95%, or 99% identical to the amino acid sequence of 
SEQ ID NO: 2 and retains the functional activity of the 

10 T139 proteins of SEQ ID NO:2. In other instances, the 
T139 protein is a protein having an amino acid ^sequence 
55%, 65%, 75%, 85%, 95%, or 98% identical to the SCP- 
like, C-type lectin, or EGF-like domain (any one of SEQ 
ID NOS:5-8). In a preferred embodiment, the T139 protein 

15 retains the functional activity of the T139 protein of 
SEQ ID NO:2. 

To determine the percent identity of two amino 
acid sequences or of two nucleic acids, the sequences are 
aligned for optimal comparison purposes (e.g., gaps can 

20 be introduced in the sequence of a first amino acid or 
nucleic acid sequence for optimal alignment with a second 
amino or nucleic acid sequence) . The amino acid residues 
or nucleotides at corresponding amino acid positions or 
nucleotide positions are then compared. When a position 

25 in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in 
the second sequence, then the molecules are identical at 
that position. The percent identity between the two 
sequences is a function of the number of identical 

30 positions shared by the sequences (i.e., % identity = # 
of identical positions/total # of positions x 100) . 

The determination of percent homology between two 
sequences can be accomplished using a mathematical 
algorithm. A preferred, non- limiting example of a 

35 mathematical algorithm utilized for the comparison of two 
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sequences is the algorithm of Karlin and Altschul (1990) 
Proc. Natl. Acad. Sci. USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 
90:5873-5877. Such an algorithm is incorporated into the 
5 NBLAST and XBLAST programs of Altschul et al . (1990) J*. 
Mol. Biol. 215:403-410. BLAST nucleotide searches can be 
performed with the NBLAST program, score = 100, 
wordlength = 12 to obtain nucleotide sequences homologous 
to T139 nucleic acid molecules of the invention. BLAST 

10 protein searches can be performed with the XBLAST 

program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to T139 protein molecules of the 
invention. To obtain gapped alignments for comparison 
purposes, Gapped BLAST can be utilized as described in 

15 Altschul et al. (1997) Nucleic Acids Res. 25:3389-3402. 
When utilizing BLAST and Gapped BLAST programs, the 
default parameters of the respective programs (e.g., 
XBLAST and NBLAST) can be used. See 

http://www.ncbi.nlm.nih.gov. Another preferred, non- 
20 limiting example of a mathematical algorithm utilized for 
the comparison of sequences is the algorithm of Myers and 
Miller, CABIOS (1989) . Such an algorithm is incorporated 
into the ALIGN program (version 2.0) which is part of the 
GCG sequence alignment software package. When utilizing 
25 the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, 
and a gap penalty of 4 can be used. 

The percent identity between two sequences can be 
determined using techniques similar to those described 
30 above, with or without allowing gaps. In calculating 
percent identity, only exact matches are counted. 

The invention also provides T139. chimeric or 
fusion proteins. As used herein, a T139 "chimeric 
protein" or "fusion protein" comprises a T139 polypeptide 
35 operatively linked to a non-T139 polypeptide. A "T139 
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polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to T139, whereas a "non-T13 9 
polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a protein which is not 
5 substantially identical to the T139 protein, e.g., a 
protein which is different from the T139 protein and 
which is derived from the same or a different organism. 
Within a T139 fusion protein the T139 polypeptide can 
correspond to all or a portion of a T139 protein, 

10 preferably at least one biologically active portion of a 
T139 protein. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the 
T139 polypeptide and the non-T13 9 polypeptide are fused 
in- frame to each other. The non-T13 9 polypeptide can be 

15 fused to the N-terminus or C-terminus of the T139 
polypeptide. 

One useful fusion protein is a GST-T139 fusion 
protein in which the T139 sequences are fused to the C- 
terminus of the GST sequences. Such fusion proteins can 

20 facilitate the purification of recombinant T139. 

In another embodiment, the fusion protein is a 
T139 protein containing a heterologous signal sequence at 
its N-terminus. For example, the native T139 signal 
sequence (i.e., about amino acids 1 to 22 of SEQ ID NO:2) 

25 can be removed and replaced with a signal sequence from 
another protein. In certain host cells (e.g., mammalian 
host cells) , expression and/or secretion of T13 9 can be 
increased through use of a heterologous signal sequence. 
For example, the gp67 secretory sequence of the 

30 baculovirus envelope protein can be used as a 

heterologous signal sequence (Current Protocols in 
Molecular Biology , Ausubel et al., eds., John Wiley & 
Sons, 1992) . Other examples of eukaryotic heterologous 
signal sequences include the secretory sequences of 

35 melittin and human placental alkaline phosphatase 
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(Stratagene; La Jolla, California) . In yet another 
example, useful prokaryotic heterologous signal sequences 
include the phoA secretory signal {Molecular cloning, 
Sambrook et al, second edition, Cold spring harbor 
5 laboratory press, 1989) and the protein A secretory 
signal (Pharmacia Biotech; Piscataway, New Jersey) . 

In yet another embodiment, the fusion protein is 
an T13 9 -immunoglobulin fusion protein in which all or 
part of T139 is fused to sequences derived from a member 

10 of the immunoglobulin protein family. (see, e.g., PCT 
Publication Number WO 88/07087 and Aruf f o~efc- al . (1990) 
Cell 61:1303-1313). The T139-iirammoglobulin fusion 
proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject 

15 to inhibit an interaction between a T139 receptor and a 
T139 protein on the surface of a cell, to thereby 
suppress T13 9 -mediated signal transduction in vivo. 
Inhibition of the T139 ligand/T139 interaction may be 
useful therapeutically for both the treatment of 

20 proliferative and different iative disorders, as well as 
modulating (e.g. promoting or inhibiting) cell survival. 
Moreover, the T13 9 -immunoglobulin fusion proteins of the 
invention can be used as immunogens to produce anti-T139 
antibodies in a subject, to purify T139 receptors and in 

25 screening assays to identify molecules which inhibit the 
interaction of T139 with its receptor. 

Preferably, a T139 chimeric or fusion protein of 
the invention is produced by standard recombinant DNA 
techniques. For example, DNA fragments coding for the 

30 different polypeptide sequences are ligated together in- 
frame in accordance with conventional techniques, for 
example by employing blunt-ended or stagger-ended termini 
for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as 

35 appropriate, alkaline phosphatase treatment to avoid 



WO 99/54343 



PCT/US99/08896 



- 32 - 

undesirable joining, and enzymatic ligation. In another 
embodiment, the fusion gene can be synthesized by 
conventional techniques including automated DNA 
synthesizers. Alternatively, PCR amplification of gene 
5 fragments can be carried out using anchor primers which 
give rise to complementary overhangs between two 
consecutive gene fragments which can subsequently be 
annealed and reamplified to generate a chimeric gene 
sequence (see, e.g., Current Protocols in Molecular 

10 Biology, Ausubel et al. eds., John Wiley & Sons: 1992). 
Moreover, many expression vectors are commercially 
available that already encode a fusion moiety (e.g., a 
GST polypeptide) . An T13 9 -encoding nucleic acid can be 
cloned into such an expression vector such that the 

15 fusion moiety is linked in-frame to the T139 protein. 

The present invention also pertains to variants of 
the T139 proteins which function as either T139 agonists 
(mimetics) or as T139 antagonists. Variants of the T13 9 
protein can be generated by mutagenesis, e.g., discrete 

20 point mutation or truncation of the T139 protein. An 

agonist of the T13 9 protein can retain substantially the 
same, or a subset, of the biological activities of the 
naturally occurring form of the T139 protein. An 
antagonist of the T139 protein can inhibit one or more of 

25 the activities of the naturally occurring form of the 

T139 protein by, for example, competitively binding to a 
downstream or upstream member of a cellular signaling 
cascade which includes the T139 protein. Thus, specific 
biological effects can be elicited by treatment with a 

30 variant of limited function. Treatment of a subject with 
a variant having a subset of the biological activities of 
the naturally occurring form of the protein can have 
fewer side effects in a subject relative to treatment 
with the naturally occurring form of the T139 proteins. 
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Variants of the T139 protein which function as 
either T139 agonists (mimetics) or as T139 antagonists 
can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the T13 9 protein 
5 for T13 9 protein agonist or antagonist activity. In one 
embodiment, a variegated library of T13 9 variants is 
generated by combinatorial mutagenesis at the nucleic 
acid level and is encoded by a variegated gene library. 
A variegated library of T13 9 variants can be produced by, 

10 for example, enzymatically ligating a mixture of 

synthetic oligonucleotides into gene sequences such that 
a degenerate set of potential T139 sequences is 
expressible as individual polypeptides, or alternatively, 
as a set of larger fusion proteins (e.g., for phage 

15 display) containing the set of T139 sequences therein. 
There are a variety of methods which can be used to 
produce libraries of potential T139 variants from a 
degenerate oligonucleotide sequence. Chemical synthesis 
of a degenerate gene sequence can be performed in an 

20 automatic DNA synthesizer, and the synthetic gene then 
ligated into an appropriate expression vector. Use of a 
degenerate set of genes allows for the provision, in one 
mixture, of all of the sequences encoding the desired set 
of potential T139 sequences. Methods for synthesizing 

25 degenerate oligonucleotides are known in the art (see, 
e.g., Narang (1983) Tetrahedron 39:3; Itakura et al. 
(1984) Annu. Rev. Biochem. 53:323; Itakura et al . (1984) 
Science 198:1056; Ike et al . (1983) Nucleic Acid Res. 
11:477). 

30 In addition, libraries of fragments of the T139 

protein coding sequence can be used to generate a 
variegated population of T139 fragments for screening and 
subsequent selection of variants of a T13 9 protein. In 
one embodiment, a library of coding sequence fragments 

35 can be generated by treating a double stranded PCR 
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fragment of a T139 coding sequence with a nuclease under 
conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing 
the DNA to form double stranded DNA which can include 
5 sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes 
by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this 
method, an expression library can be derived which 

10 encodes N-terminal and internal fragments of various 
sizes of the T139 protein. — - 

Several techniques are known in the art for 
screening gene products of combinatorial libraries made 
by point mutations or truncation, and for screening cDNA 

15 libraries for gene products having a selected property. 
Such techniques are adaptable for rapid screening of the 
gene libraries generated by the combinatorial mutagenesis 
of T139 proteins. The most widely used techniques, which 
are amenable to high through-put analysis, for screening 

20 large gene libraries typically include cloning the gene 
library into replicable expression vectors, transforming 
appropriate cells with the resulting library of vectors, 
and expressing the combinatorial genes under conditions 
in which detection of a desired activity facilitates 

25 isolation of the vector encoding the gene whose product 
was detected. Recursive ensemble mutagenesis (REM) , a 
technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with 
the screening assays to identify T139 variants (Arkin and 

30 Yourvan (1992) Proc. Natl. Acad. Sci. USA 89:7811-7815; 
Delgrave et al. (1993) Protein Engineering 6(3) : 327-331) . 

An isolated T13 9 protein, or a portion or fragment 
thereof, can be used as an immunogen to generate 
antibodies that bind T13 9 using standard techniques for 

35 polyclonal and monoclonal antibody preparation. The 
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full-length T139 protein can be used or, alternatively, 
the invention provides antigenic peptide fragments of 
T139 for use as immunogens. The antigenic peptide of 
T139 comprises at least 8 (preferably 10, 15, 20, or 30) 
5 amino acid residues of the amino acid sequence shown in 
SEQ ID NO: 2 and encompasses an epitope of T139 such that 
an antibody raised against the peptide forms a specific 
immune complex with T139. 

Preferred epitopes encompassed by the antigenic 

10 peptide are regions of T139 that are located on the 
surface of the protein, e.g., hydrophilic- regions. A 
hydrophobicity analysis of the human T139 protein 
sequence indicates that the regions between, e.g., amino 
acids 125 and 140, between amino acids 265 and 280, and 

15 between amino acids 315 and 335 of SEQ ID NO: 2 are 

particularly hydrophilic and, therefore, are likely to 
encode surface residues useful for targeting antibody 
production. 

A T139 immunogen typically is used to prepare 

20 antibodies by immunizing a suitable subject, (e.g., 

rabbit, goat, mouse or other mammal) with the immunogen. 
An appropriate immunogenic preparation can contain, for 
example, recombinantly expressed T139 protein or a 
chemically synthesized T139 polypeptide. The preparation 

25 can further include an adjuvant, such as Freund' s 
complete or incomplete adjuvant, or similar 
immunostimulatory agent. Immunization of a suitable 
subject with an immunogenic T13 9 preparation induces a 
polyclonal anti-T139 antibody response. 

3 0 Accordingly, another aspect of the invention 

pertains to anti-T139 antibodies. The term "antibody" as 
used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin 
molecules, i.e., molecules that contain an antigen 

35 binding site which specifically binds an antigen, such as 
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T139. A molecule which specifically binds to T139 is a 
molecule which binds T139, but does not substantially 
bind other molecules in a sample, e.g., a biological 
sample, which naturally contains T139. Examples of 
5 immunologically active portions of immunoglobulin 

molecules include F(ab) and F(ab') 2 fragments which can be 
generated by treating the antibody with an enzyme such as 
pepsin. The invention provides polyclonal and monoclonal 
antibodies that bind T139. The term "monoclonal 

10 antibody" or "monoclonal antibody composition", as used 
herein, refers to a population of antibody molecules that 
contain only one species of an antigen binding site 
capable of immunoreacting with a particular epitope of 
T13 9. A monoclonal antibody composition thus typically 

15 displays a single binding affinity for a particular T139 
protein with which it immunoreacts . 

Polyclonal anti-T139 antibodies can be prepared as 
described above by immunizing a suitable subject with a 
T139 immunogen. The anti-T139 antibody titer in the 

20 immunized subject can be monitored over time by standard 
techniques, such as with an enzyme linked immunosorbent 
assay (ELISA) using immobilized T139. If desired, the 
antibody molecules directed against T139 can be isolated 
from the mammal (e.g., from the blood) and further 

25 purified by well-known techniques, such as protein A 
chromatography to obtain the IgG fraction. At an 
appropriate time after immunization, e.g., when the anti- 
T139 antibody titers are highest , antibody-producing 
cells can be obtained from the subject and used to 

30 prepare monoclonal antibodies by standard techniques, 
such as the hybridoma technique originally described by 
Kohler and Milstein (1975) Nature 256:495-497, the human 
B cell hybridoma technique (Kozbor et al . (1983) Immunol 
Today 4:72), the EBV-hybridoma technique (Cole et al . 

35 (1985) , Monoclonal Antibodies and Cancer Therapy, Alan R. 
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Liss, Inc., pp. 77-96) or trioma techniques. The 
technology for producing various antibodies monoclonal 
antibody hybridomas is well known (see, e.g., Current 
Protocols in Immunology (1994) Coligan et al . (eds.) John 
5 Wiley & Sons, Inc., New York, NY). Briefly, an immortal 
cell line (typically a myeloma) is fused to lymphocytes 
(typically splenocytes) from a mammal immunized with a 
T139 immunogen as described above, and the culture 
supernatants of the resulting hybridoma cells are 

10 screened to identify a hybridoma producing a monoclonal 
antibody that binds T139. 

Any of the many well known protocols used for 
fusing lymphocytes and immortalized cell lines can be 
applied for the purpose of generating an anti-T139 

15 monoclonal antibody (see, e.g., Current Protocols in 

Immunology, supra; Galfre et al. (1977) Mature 266:55052; 
R.H. Kenneth, in Monoclonal Antibodies: A New Dimension 
In Biological Analyses, Plenum Publishing Corp., New 
York, New York (1980); and Lerner (1981) Yale J. Biol. 

20 Med., 54:387-402. Moreover, the ordinarily skilled 

worker will appreciate that there are many variations of 
such methods which also would be useful. Typically, the 
immortal cell line (e.g., a myeloma cell line) is derived 
from the same mammalian species as the lymphocytes. For 

25 example, murine hybridomas can be made by fusing 

lymphocytes from a mouse immunized with an immunogenic 
preparation of the present invention with an immortalized 
mouse cell line, e.g., a myeloma cell line that is 
sensitive to culture medium containing hypoxanthine, 

30 aminopterin and thymidine ("HAT medium"). Any of a 
number of myeloma cell lines can be used as a fusion 
partner according to standard techniques, e.g., the P3- 
NSl/l-Ag4-l, P3-x63-Ag8 .653 or Sp2/0-Agl4 myeloma lines. 
These myeloma lines are available from ATCC. Typically, 

35 HAT-sensitive mouse myeloma cells are fused to mouse 
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splenocytes using polyethylene glycol ("PEG") . Hybridoma 
cells resulting from the fusion are then selected using 
HAT medium, which kills unfused and unproductively fused 
myeloma cells (unfused splenocytes die after several days 
5 because they are not transformed) . Hybridoma cells 
producing a monoclonal antibody of the invention are 
detected by screening the hybridoma culture supernatants 
for antibodies that bind T139, e.g., using a standard 
ELISA assay. 

10 Alternative to preparing monoclonal antibody- 

secreting hybridomas, a monoclonal anti-T139 antibody can 
be identified and isolated by screening a recombinant 
combinatorial immunoglobulin library (e.g., an antibody 
phage display library) with T13 9 to thereby isolate 

15 immunoglobulin library members that bind T139. Kits for 
generating and screening phage display libraries are 
commercially available (e.g., the Pharmacia Recombinant 
Phage Antibody System, Catalog No. 27-9400-01; and the 
Stratagene Surf ZAP™ Phage Display Kit, Catalog No. 

2 0 24 0612) . Additionally, examples of methods and reagents 
particularly amenable for use in generating and screening 
antibody display library can be found in, for example, 
U.S. Patent No. 5,223,409; PCT Publication No. WO 
92/18619; PCT Publication No. WO 91/17271; PCT 

25 Publication WO 92/20791; PCT Publication No. WO 92/15679; 
PCT Publication WO 93/01288; PCT Publication No. WO 
92/01047; PCT Publication No. WO 92/09690; PCT 
Publication No. WO 90/02809; Fuchs et al . (1991) 
Bio/Technology 9:1370-1372; Hay et al. (1992) Hum. 

30 Antibod. Hybridomas 3:81-85; Huse et al. (1989) Science 
246:1275-1281; Griffiths et al. (1993) EMBO J 12:725-734, 

Additionally, recombinant anti-T139 antibodies, 
such as chimeric and humanized monoclonal antibodies, 
comprising both human and non-human portions, which can 

35 be made using standard recombinant DNA techniques, are 
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within the scope of the invention. Such chimeric and 
humanized monoclonal antibodies can be produced by 
recombinant DNA techniques known in the art, for example 
using methods described in PCT Publication No. WO 
5 87/02671; European Patent Application 184,187; European 
Patent Application 171,496; European Patent Application 
173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et 
al. (1988) Science 240:1041-1043; Liu et al. (1987) Proc. 

10 Natl. Acad. Sci . USA 84:3439-3443; Liu et al . (1987) J* . 
Immunol. 139:3521-3526; Sun et al . (1987) Proc. Natl. 
Acad. Sci. USA 84:214-218; Nishimura et al . (1987) Cane. 
Res. 47:999-1005; Wood et al . (1985) Nature 314:446-449; 
and Shaw et al. (1988) J. Natl. Cancer Inst. 80:1553- 

15 1559); Morrison, (1985) Science 229:1202-1207; Oi et al . 
(1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; Jones 
et al. (1986) Nature 321:552-525; Verhoeyan et al. (1988) 
Science 239:1534; and Beidler et al. (1988) J. Immunol. 
141:4053-4060. 

20 An anti-T139 antibody (e.g., monoclonal antibody) 

can be used to isolate T13 9 by standard techniques, such 
as affinity chromatography or immunoprecipitation. An 
anti-T139 antibody can facilitate the purification of 
natural T139 from cells and of recombinantly produced 

25 T139 expressed in host cells. Moreover, an anti-T139 
antibody can be used to detect T139 protein (e.g., in a 
cellular lysate or cell supernatant) in order to evaluate 
the abundance and pattern of expression of the T139 
protein. Anti-T139 antibodies can be used diagnostically 

30 to monitor protein levels in tissue as part of a clinical 
testing procedure, e.g., to, for example, determine the 
efficacy of a given treatment regimen. Detection can be 
facilitated by coupling the antibody to a detectable 
substance. Examples of detectable substances include 

35 various enzymes, prosthetic groups, fluorescent 
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materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of 
suitable enzymes include horseradish peroxidase, alkaline 
phosphatase, j3-galactosidase, or acetylcholinesterase; 
5 examples of suitable prosthetic group complexes include 
streptavidin/biotin and avidin/biotin; examples of 
suitable fluorescent materials include umbellif erone, 
fluorescein, fluorescein isothiocyanate, rhodamine, 
dichlorotriazinylamine fluorescein, dansyl chloride or 
10 phycoerythrin; an example of a luminescent material 

includes luminol; examples of bioluminescent materials 
include lucif erase, luciferin, and aequorin, and examples 
of suitable radioactive material include 125 I, 131 I, 35 S or 

15 III. Recombinant Expression Vectors and Host Cells 
Another aspect of the invention pertains to 
vectors, preferably expression vectors, containing a 
nucleic acid encoding T139 (or a portion thereof) . As 
used herein, the term "vector" refers to a nucleic acid 

20 molecule capable of transporting another nucleic acid to 
which it has been linked. One type of vector is a 
"plasmid", which refers to a circular double stranded DNA 
loop into which additional DNA segments can be ligated. 
Another type of vector is a viral vector, wherein 

25 additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous 
replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of 
replication and episomal mammalian vectors) . Other 

30 vectors (e.g., non-episomal mammalian vectors) are 
integrated into the genome of a host cell upon 
introduction into the host cell, and thereby are 
replicated along with the host genome. Moreover, certain 
vectors, expression vectors, are capable of directing the 
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expression of genes to which they are operatively linked. 
In general, expression vectors of utility in recombinant 
DNA techniques are often in the form of plasmids 
(vectors) . However, the invention is intended to include 
5 such other forms of expression vectors, such as viral 
vectors (e.g., replication defective retroviruses, 
adenoviruses and adeno-associated viruses) , which serve 
equivalent functions. 

The recombinant expression vectors of the 

10 invention comprise a nucleic acid of the invention in a 
form suitable for expression of the nucleic acid in a 
host cell, which means that the recombinant expression 
vectors include one or more regulatory sequences, 
selected on the basis of the host cells to be used for 

15 expression, which is operatively linked to the nucleic 
acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean 
that the nucleotide sequence of interest is linked to the 
regulatory sequence (s) in a manner which allows for 

20 expression of the nucleotide sequence (e.g., in an in 

vitro transcription/translation system or in a host cell 
when the vector is introduced into the host cell) . The 
term "regulatory sequence" is intended to include 
promoters, enhancers and other expression control 

25 elements (e.g., polyadenylation signals). Such 

regulatory sequences are described, for example, in 
Goeddel; Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990) . 
Regulatory sequences include those which direct 

30 constitutive expression of a nucleotide sequence in many 
types of host cell and those which direct expression of 
the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences) . It will be 
appreciated by those skilled in the art that the design 

35 of the expression vector can depend on such factors as 
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the choice of the host cell to be transformed, the level 
of expression of protein desired, etc. The expression 
vectors of the invention can be introduced into host 
cells to thereby produce proteins or peptides, including 
5 fusion proteins or peptides, encoded by nucleic acids as 
described herein (e.g., T139 proteins, mutant forms of 
T139, fusion proteins, etc.). 

The recombinant expression vectors of the 
invention can be designed for expression of T139 in 

10 prokaryotic or eukaryotic cells, e.g., bacterial cells 
such as E. coli, insect cells (using baculovirus 
expression vectors) yeast cells or mammalian cells. 
Suitable host cells are discussed further in Goeddel, 
Gene Expression Technology: Methods in Enzymology 185, 

15 Academic Press, San Diego, CA (1990) . Alternatively, the 
recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter 
regulatory sequences and T7 polymerase. 

Expression of proteins in prokaryotes is most 

20 often carried out in E. coli with vectors containing 
constitutive or inducible promoters directing the 
expression of either fusion or non-fusion proteins. 
Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the 

25 recombinant protein. Such fusion vectors typically serve 
three purposes: 1) to increase expression of recombinant 
protein; 2) to increase the solubility of the recombinant 
protein; and 3) to aid in the purification of the 
recombinant protein by acting as a ligand in affinity 

30 purification. Often, in fusion expression vectors, a 
proteolytic cleavage site is introduced at the junction 
of the fusion moiety and the recombinant protein to 
enable separation of the recombinant protein from the 
fusion moiety subsequent to purification of the fusion 

35 protein. Such enzymes, and their cognate recognition 
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sequences, include Factor Xa, thrombin and enterokinase . 
Typical fusion expression vectors include pGEX (Pharmacia 
Biotech Inc; Smith and Johnson (1988) Gene 67:31-40), 
pMAL (New England Biolabs, Beverly, MA) and pRIT5 
5 (Pharmacia, Piscataway, NJ) which fuse glutathione S- 
transf erase (GST) , maltose E binding protein, or protein 
A, respectively, to the target recombinant protein. 

Examples of suitable inducible non- fusion E. coli 
expression vectors include pTrc (Amann et al . , (1988) 

10 Gene 69:301-315) and pET lid (Studier et al., Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 60-89) . 
Target gene expression from the pTrc vector relies on 
host RNA polymerase transcription from a hybrid trp-lac 

15 fusion promoter. Target gene expression from the pET lid 
vector relies on transcription from a T7 gnlO-lac fusion 
promoter mediated by a coexpressed viral RNA polymerase 
(T7 gnl) . This viral polymerase is supplied by host 
strains BL21(DE3) or HMS174(DE3) from a resident X 

20 prophage harboring a T7 gnl gene under the 

transcriptional control of the lacUV 5 promoter. 

One strategy to maximize recombinant protein 
expression in E. coli is to express the protein in a host 
bacteria with an impaired capacity to proteolytically 

25 cleave the recombinant protein (Gottesman, Gene 
Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, California (1990) 119-128) . 
Another strategy is to alter the nucleic acid sequence of 
the nucleic acid to be inserted into an expression vector 

30 so that the individual codons for each amino acid are 
those preferentially utilized in E. coli (Wada et al . 
(1992) Nucleic Acids Res. 20:2111-2118). Such alteration 
of nucleic acid sequences of the invention can be carried 
out by standard DNA synthesis techniques. 
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In another embodiment, the T13 9 expression vector 
is a yeast expression vector. Examples of vectors for 
expression in yeast S. cerivisae include pYepSecl 
(Baldari et al. (1987) EMBO J. 6:229-234), pMFa (Kurjan 
5 and Herskowitz, (1982) Cell 30 : 933-943) , pJRY88 (Schultz 
et al- (1987) Gene 54:113-123), pYES2 (Invitrogen 
Corporation, San Diego, CA) , and picZ (InVitrogen Corp, 
San Diego, CA) . 

Alternatively, T139 can be expressed in insect 

10 cells using baculovirus expression vectors. Baculovirus 
vectors available for expression of proteins in cultured 
insect cells (e.g., Sf 9 cells) include the pAc series 
(Smith et al . (1983) Afol . Cell Biol. 3:2156-2165) and the 
pVL series (Lucklow and Summers (1989) Virology 170:31- 

15 39) . 

In yet another embodiment, a nucleic acid of the 
invention is expressed in mammalian cells using a 
mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed (1987) Nature 

20 329:840) and pMT2PC (Kaufman et al . (1987) EMBO J. 6:187- 
195) . When used in mammalian cells, the expression 
vector's control functions are often provided by viral 
regulatory elements. For example, commonly used 
promoters are derived from polyoma, Adenovirus 2, 

25 cytomegalovirus and Simian Virus 40. For other suitable 
expression systems for both prokaryotic and eukaryotic 
cells see chapters 16 and 17 of Sambrook et al . (supra). 

In another embodiment, the recombinant mammalian 
expression vector is capable of directing expression of 

3 0 the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to 
express the nucleic acid) . Tissue-specific regulatory 
elements are known in the art. Non-limiting examples of 
suitable tissue-specific promoters include the albumin 

35 promoter (liver-specific; Pinkert et al. (1987) Genes 
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Dev. 1:268-277), lymphoid- specific promoters (Calame and 
Eaton (1988) Adv. Immunol. 43:235-275), in particular 
promoters of T cell receptors (Winoto and Baltimore 
(1989) EMBO J". 8:729-733) and immunoglobulins (Banerji et 
5 al. (1983) Cell 33:729-740; Queen and Baltimore (1983) 
Cell 33 : 741-748) , neuron- specific promoters (e.g., the 
neurofilament promoter; Byrne and Ruddle (1989) Proc. 
Natl. Acad. Sci . USA 86:5473-5477), pancreas -specific 
promoters (Edlund et al . (1985) Science 230:912-916), and 

10 mammary gland-specific promoters (e.g., milk whey 
promoter; U.S. Patent No. 4,873,346 and European 
Application Publication No. 264,166). Developmental ly- 
regulated promoters are also encompassed, for example the 
murine hox promoters (Kessel and Gruss (1990) Science 

15 249:374-379) and the or- fetoprotein promoter (Campes and 
Tilghman (1989) Genes Dev. 3:537-546). 

The invention further provides a recombinant 
expression vector comprising a DNA molecule of the 
invention cloned into the expression vector in an 

20 antisense orientation. That is, the DNA molecule is 

operatively linked to a regulatory sequence in a manner 
which allows for expression (by transcription of the DNA 
molecule) of an RNA molecule which is antisense to T139 
mRNA. Regulatory sequences operatively linked to a 

25 nucleic acid cloned in the antisense orientation can be 
chosen which direct the continuous expression of the 
antisense RNA molecule in a variety of cell types, for 
instance viral promoters and/or enhancers, or regulatory 
sequences can be chosen which direct constitutive, tissue 

30 specific or cell type specific expression of antisense 
RNA. The antisense expression vector can be in the form 
of a recombinant plasmid, phagemid or attenuated virus in 
which antisense nucleic acids are produced under the 
control of a high efficiency regulatory region, the 

35 activity of which can be determined by the cell type into 
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which the vector is introduced. For a discussion of the 
regulation of gene expression using antisense genes See 
Weintraub et al. f Reviews - Trends in Genetics, Vol. 1(1) 
1986. 

5 Another aspect of the invention pertains to host 

cells into which a recombinant expression vector of the 
invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. 
It is understood that such terms refer not only to the 

10 particular subject cell but to the progeny or potential 
progeny of such a cell. Because, certain modifications 
may occur in succeeding generations due to either 
mutation or environmental influences, such progeny may 
not, in fact, be identical to the parent cell, but are 

15 still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic or eukaryotic 
cell. For example, T139 protein can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or 

20 mammalian cells (such as Chinese hamster ovary cells 

(CH0) or COS cells) . Other suitable host cells are known 
to those skilled in the art. 

Vector DNA can be introduced into prokaryotic or 
eukaryotic cells via conventional transformation or 

25 transfection techniques. As used herein, the terms 

11 tranjs formation 11 and "transfection" are intended to refer 
to a variety of art -recognized techniques for introducing 
foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co- 

30 precipitation, DEAE-dextran-mediated transfection, 

lipofection, or electroporation. Suitable methods for 
transforming or transfecting host cells can be found in 
Sambrook, et al. (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is 

35 known that, depending upon the expression vector and 
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transf ection technique used, only a small fraction of 
cells may integrate the foreign DNA into their genome. 
In order to identify and select these integrants, a gene 
that encodes a selectable marker (e.g., resistance to 
5 antibiotics) is generally introduced into the host cells 
along with the gene of interest. Preferred selectable 
markers include those which confer resistance to drugs, 
such as G418, hygromycin and methotrexate. Nucleic acid 
encoding a selectable marker can be introduced into a 

10 host cell on the same vector as that encoding T139 or can 
be introduced on a separate vector. Cells stably 
transfected with the introduced nucleic acid can be 
identified by drug selection (e.g., cells that have 
incorporated the selectable marker gene will survive, 

15 while the other cells die) . 

A host cell of the invention, such as a 
prokaryotic or eukaryotic host cell in culture, can be 
used to produce (i.e., express) T139 protein. 
Accordingly, the invention further provides methods for 

20 producing T139 protein using the host cells of the 
invention. In one embodiment, the method comprises 
culturing the host cell of invention (into which a 
recombinant expression vector encoding T139 has been 
introduced) in a suitable medium such that T139 protein 

25 is produced. In another embodiment, the method further 
comprises isolating T13 9 from the medium or the host 
cell . 

The host cells of the invention can also be used 
to produce nonhuman transgenic animals. For example, in 

30 one embodiment, a host cell of the invention is a 

fertilized oocyte or an embryonic stem cell into which 
T139-coding sequences have been introduced. Such host 
cells can then be used to create non-human transgenic 
animals in which exogenous T13 9 sequences have been 

35 introduced into their genome or homologous recombinant 
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animals in which endogenous T13 9 sequences have been 
altered. Such animals are useful for studying the . 
function and/or activity of T13 9 and for identifying 
and/or evaluating modulators of T13 9 activity. As used 
5 herein, a "transgenic animal" is a non-human animal, 
preferably a mammal, more preferably a rodent such as a 
rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of 
transgenic animals include non-human primates, sheep, 

10 dogs, cows, goats, chickens, amphibians, etc. A 

transgene is exogenous DNA which is integrated into the 
genome of a cell from which a transgenic animal develops 
and which remains in the genome of the mature animal , 
thereby directing the expression of an encoded gene 

15 product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous 
recombinant animal" is a non-human animal, preferably a 
mammal, more preferably a mouse, in which an endogenous 
T139 gene has been altered by homologous recombination 

20 between the endogenous gene and an exogenous DNA molecule 
introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be 
created by introducing T13 9 -encoding nucleic acid into 

25 the male pronuclei of a fertilized oocyte, e.g., by 
microinjection, retroviral infection, and allowing the 
oocyte to develop in a pseudopregnant female foster 
animal. The T139 cDNA sequence e.g., that of (SEQ ID 
NO:l, SEQ ID NO:3, or the cDNA of ATCC 98694) can be 

30 introduced as a transgene into the genome of a non-human 
animal. Alternatively, a nonhuman homologue of the human 
T13 9 gene, such as a mouse T13 9 gene, can be isolated 
based on hybridization to the human T139 cDNA and used as 
a transgene. Intronic sequences and polyadenylation 

35 signals can also be included in the transgene to increase 
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the efficiency of expression of the transgene. A tissue- 
specific regulatory sequence (s) can be operably linked to 
the T139 transgene to direct expression of T139 protein 
to particular cells. Methods for generating transgenic 
5 animals via embryo manipulation and microinjection, 
particularly animals such as mice, have become 
conventional in the art and are described, for example, 
in U.S. Patent Nos . 4,736,866 and 4,870,009, U.S. Patent 
No. 4,873,191 and in Hogan, Manipulating the Mouse 

10 Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, N.Y. , 1986). Similar methods are used for 
production of other transgenic animals. A transgenic 
founder animal can be identified based upon the presence 
of the T139 transgene in its genome and/ or expression of 

15 T139 mRNA in tissues or cells of the animals. A 

transgenic founder animal can then be used to breed 
additional animals carrying the transgene. Moreover, 
transgenic animals carrying a transgene encoding T139 can 
further be bred to other transgenic animals carrying 

20 other transgenes. 

To create an homologous recombinant animal , a 
vector is prepared which contains at least a portion of a 
T13 9 gene (e.g., a human or a non-human homolog of the 
T139 gene, e.g., a murine T139 gene) into which a 

25 deletion, addition or substitution has been introduced to 
thereby alter, e.g., functionally disrupt, the T139 gene. 
In a preferred embodiment, the vector is designed such 
that, upon homologous recombination, the endogenous T139 
gene is functionally disrupted (i.e., no longer encodes a 

30 functional protein; also referred to as a "knock out" 

vector) . Alternatively, the vector can be designed such 
that, upon homologous recombination, the endogenous T13 9 
gene is mutated or otherwise altered but still encodes 
functional protein (e.g., the upstream regulatory region 

35 can be altered to thereby alter the expression of the 
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endogenous T139 protein) . In the homologous 
recombination vector, the altered portion of the T139 
gene is flanked at its 5' and 3' ends by additional 
nucleic acid of the T139 gene to allow for homologous 
5 recombination to occur between the exogenous T13 9 gene 
carried by the vector and an endogenous T139 gene in an 
embryonic stem cell. The additional flanking T13 9 
nucleic acid is of sufficient length for successful 
homologous recombination with the endogenous gene. 

10 Typically, several kilobases of flanking DNA (both at the 
5' and 3' ends) are included in the vector (see e.g., 
Thomas and Capecchi (1987) Cell 51:503 for a description 
of homologous recombination vectors) . The vector is 
introduced into an embryonic stem cell line (e.g., by 

15 electroporation) and cells in which the introduced T139 
gene has homologously recombined with the endogenous T139 
gene are selected (see e.g., Li et al . (1992) Cell 
69:915) . The selected cells are then injected into a 
blastocyst of an animal (e.g., a mouse) to form 

20 aggregation chimeras (see, e.g., Bradley in 

Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, Robertson, ed. (IRL, Oxford, 1987) pp. 113- 
152) . A chimeric embryo can then be implanted into a 
suitable pseudopregnant female foster animal and the 

25 embryo brought to term. Progeny harboring the 

homologously recombined DNA in their germ cells can be 
used to breed animals in which all cells of the animal 
contain the homologously recombined DNA by germline 
transmission of the transgene. Methods for constructing 

30 homologous recombination vectors and homologous 

recombinant animals are described further in Bradley 
(1991) Current Opinion in Bio/Technology 2:823-829 and in 
PCT Publication Nos. WO 90/11354, WO 91/01140, WO 
92/0968, and WO 93/04169. 
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In another embodiment, transgenic non-humans 
animals can be produced which contain selected systems 
which allow for regulated expression of the transgene. 
One example of such a system is the cre/loxP recombinase 
5 system of bacteriophage PI. For a description of the 
cre/loxP recombinase system, see, e.g., Lakso et al . 
(1992) Proc. Natl. Acad. Sci. USA 89:6232-6236. Another 
example of a recombinase system is the FLP recombinase 
system of Saccharomyces cerevisiae (O' Gorman et al. 

10 (1991) Science 251:1351-1355. If a cre/loxP recombinase 
system is used to regulate expression of the transgene, 
animals containing transgenes encoding both the Cre 
recombinase and a selected protein are required. Such 
animals can be provided through the construction of 

15 "double" transgenic animals, e.g., by mating two 

transgenic animals, one containing a transgene encoding a 
selected protein and the other containing a transgene 
encoding a recombinase. 

Clones of the non-human transgenic animals 

20 described herein can also be produced according to the 

methods described in Wilmut et al . (1997) Nature 385 : 810- 
813 and PCT Publication Nos. WO 97/07668 and WO 97/07669. 
In brief, a cell, e.g., a somatic cell, from the 
transgenic animal can be isolated and induced to exit the 

25 growth cycle and enter G 0 phase. The quiescent cell can 
then be fused, e.g., through the use of electrical 
pulses, to an enucleated oocyte from an animal of the 
same species from which the quiescent cell is isolated. 
The reconstructed oocyte is then cultured such that it 

30 develops to morula or blastocyte and then transferred to 
pseudopregnant female foster animal . The offspring borne 
of this female foster animal will be a clone of the 
animal from which the cell, e.g., the somatic cell, is 
isolated. 
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TV. Pharmaceutical Compositions 

The T139 nucleic acid molecules, T13 9 proteins, 
and anti-T139 antibodies (also referred to herein as 
"active compounds") of the invention can be incorporated 
5 into pharmaceutical compositions suitable for 

administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" is 

10 intended to include any and all solvents, dispersion 
media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. The use 
of such media and agents for pharmaceutically active 

15 substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the 
active compound, use thereof in the compositions is 
contemplated. Supplementary active compounds can also be 
incorporated into the compositions. 

20 A pharmaceutical composition of the invention is 

formulated to be compatible with its intended route of 
administration. Examples of routes of administration 
include parenteral , e.g., intravenous , intradermal , 
subcutaneous, oral (e.g., inhalation), transdermal 

25 (topical), transmucosal , and rectal administration. 
Solutions or suspensions used for parenteral, 
intradermal, or subcutaneous application can include the 
following components: a sterile diluent such as water for 
injection, saline solution, fixed oils, polyethylene 

30 glycols, glycerine, propylene glycol or other synthetic 
solvents; antibacterial agents such as benzyl alcohol or 
methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as 
ethylenediaminetetraacetic acid; buffers such as 

35 acetates, citrates or phosphates and agents for the 
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adjustment of tonicity such as sodium chloride or 
dextrose. pH can be adjusted with acids or bases, such 
as hydrochloric acid or sodium hydroxide. The parenteral 
preparation can be enclosed in ampoules, disposable 
5 syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for 
injectable use include sterile aqueous solutions (where 
water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable 

10 solutions or dispersion. For intravenous administration, 
suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL™ (BASF; Parsippany, 
NJ) or phosphate buffered saline (PBS) . In all cases, 
the composition must be sterile and should be fluid to 

15 the extent that easy syringability exists. It must be 
stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of 
microorganisms such as bacteria and fungi. The carrier 
can be a solvent or dispersion medium containing, for 

2 0 example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyetheylene glycol, and 
the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a 
coating such as lecithin, by the maintenance of the 

25 required particle size in the case of dispersion and by 
the use of surfactants. Prevention of the action of 
microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, 
chlorobutanol , phenol, ascorbic acid, thimerosal, and the 

30 like. In many cases, it will be preferable to include 
isotonic agents, for example, sugars, polyalcohols such 
as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable 
compositions can be brought about by including in the 
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composition an agent which delays absorption, for 
example, aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a T139 protein 
5 or anti-T139 antibody) in the required amount in an 
appropriate solvent with one or a combination of 
ingredients enumerated above, as required, followed by 
filtered sterilization. Generally, dispersions are 
prepared by incorporating the active compound into a 

10 sterile vehicle which contains a basic dispersion medium 
and the required other ingredients from those enumerated 
above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and 

15 freeze-drying which yields a powder of the active 

ingredient plus any additional desired ingredient from a 
previously sterile-filtered solution thereof. 

Oral compositions generally include an inert 
diluent or an edible carrier. They can be enclosed in 

20 gelatin capsules or compressed into tablets. For the 
purpose of oral therapeutic administration, the active 
compound can be incorporated with excipients and used in 
the form of tablets, troches, or capsules. Oral 
compositions can also be prepared using a fluid carrier 

25 for use as a mouthwash, wherein the compound in the fluid 
carrier is applied orally and swished and expectorated or 
swallowed. Pharmaceutically compatible binding agents, 
and/or adjuvant materials can be included as part of the 
composition. The tablets, pills, capsules, troches and 

30 the like can contain any of the following ingredients, or 
compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an 
excipient such as starch or lactose, a disintegrating 
agent such as alginic acid, Primogel, or corn starch; a 

35 lubricant such as magnesium stearate or Sterotes; a 
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glidant such as colloidal silicon dioxide; a sweetening 
agent such as sucrose or saccharin; or a flavoring agent 
such as peppermint, methyl salicylate, or orange 
flavoring. For administration by inhalation, the 

5 compounds are delivered in the form of an aerosol spray 
from pressured container or dispenser which contains a 
suitable propellant, e.g., a gas such as carbon dioxide, 
or a nebulizer. 

Systemic administration can also be by 

10 transmucosal or transdermal means. For transmucosal or 
transdermal administration, penetrants appropriate to the 
barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art, and 
include, for example, for transmucosal administration, 

15 detergents, bile salts, and fusidic acid derivatives. 
Transmucosal administration can be accomplished through 
the use of nasal sprays or suppositories. For 
transdermal administration, the active compounds are 
formulated into ointments, salves, gels, or creams as 

20 generally known in the art. 

The compounds can also be prepared in the form of 
suppositories (e.g., with conventional suppository bases 
such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

25 In one embodiment, the active compounds are 

prepared with carriers that will protect the compound 
against rapid elimination from the body, such as a 
controlled release formulation, including implants and 
microencapsulated delivery systems. Biodegradable, 

30 biocompatible polymers can be used, such as ethylene 
vinyl acetate, polyanhydrides, polyglycolic acid, 
collagen, polyorthoesters, and polylactic acid. Methods 
for preparation of such formulations will be apparent to 
those skilled in the art. The materials can also be 

35 obtained commercially from Alza Corporation and Nova 
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Pharmaceuticals, Inc. Liposomal suspensions (including 
liposomes targeted to infected cells with monoclonal 
antibodies to viral antigens) can also be used as 
pharmaceutical ly acceptable carriers. These can be 
5 prepared according to methods known to those skilled in 
the art # for example, as described in U.S. Patent No. 
4,522,811. 

It is especially advantageous to formulate oral or 
parenteral compositions in dosage unit form for ease of 

10 administration and uniformity of dosage. Dosage unit 
form as used herein refers to physically discrete units 
suited as unitary dosages for the subject to be treated; 
each unit containing a predetermined quantity of active 
compound calculated to produce the desired therapeutic 

15 effect in association with the required pharmaceutical 
carrier. The specification for the dosage unit forms of 
the invention are dictated by and directly dependent on 
the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the 

20 limitations inherent in the art of compounding such an 
active compound for the treatment of individuals. 

The nucleic acid molecules of the invention can be 
inserted into vectors and used as gene therapy vectors. 
Gene therapy vectors can be delivered to a subject by, 

25 for example, intravenous injection, local administration 
(see U.S. Patent 5,328,470) or by stereotactic injection 
(see e.g., Chen et al . (1994) Proc. Natl. Acad. Sci . USA 
91:3054-3057). The pharmaceutical preparation of the 
gene therapy vector can include the gene therapy vector 

30 in an acceptable diluent, or can comprise a slow release 
matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector 
can be produced intact from recombinant cells, e.g. 
retroviral vectors, the pharmaceutical preparation can 
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include one or more cells which produce the gene delivery 
system. 

The pharmaceutical compositions can be included in 
a container, pack, or dispenser together with 
5 instructions for administration. 



V. Uses and Methods of the Invention 

The nucleic acid molecules, proteins, protein 
homologues, and antibodies described herein can be used 
in one or more of the following methods: a) screening 

10 assays; b) detection assays (e.g., chromosomal mapping, 
tissue typing, forensic biology) , c) predictive medicine 
(e.g., diagnostic assays, prognostic assays, monitoring 
clinical trials, and pharmacogenomics) ; and d) methods of 
treatment (e.g., therapeutic and prophylactic). A T139 

15 protein interacts with other cellular proteins and can 
thus be used for (i) regulation of cellular 
proliferation; (ii) regulation of cellular 
differentiation; and (iii) regulation of cell survival. 
The isolated nucleic acid molecules of the invention can 

20 be used to express T139 protein (e.g., via a recombinant 
expression vector in a host cell in gene therapy 
applications), to detect T139 mRNA (e.g., in a biological 
sample) or a genetic lesion in a T139 gene, and to 
modulate T13 9 activity. In addition, the T139 proteins 

25 can be used to screen drugs or compounds which modulate 
the T139 activity or expression as well as to treat 
disorders characterized by insufficient or excessive 
production of T139 protein or production of T139 protein 
forms which have decreased or aberrant activity compared 

30 to T139 wild type protein. In addition, the anti-T139 
antibodies of the invention can be used to detect and 
isolate T13 9 proteins and modulate T13 9 activity. 
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This invention further pertains to novel agents 
identified by the above -described screening assays and 
uses thereof for treatments as described herein. 

A. Screening Assays 
5 The invention provides a method (also referred to 

herein as a "screening assay") for identifying 
modulators, i.e., candidate or test compounds or agents 
(e.g., peptides, peptidomimetics, small molecules or 
other drugs) which bind to T139 proteins or have a 

10 stimulatory or inhibitory effect on, for example, T139 
expression or T139 activity. 

In one embodiment, the invention provides assays 
for screening candidate or test compounds which bind to 
or modulate the activity of a T139 polypeptide or 

15 biologically active portion thereof. The test compounds 
of the present invention can be obtained using any of the 
numerous approaches in combinatorial library methods 
known in the art, including: biological libraries; 
spatially addressable parallel solid phase or solution 

20 phase libraries; synthetic library methods requiring 
deconvolution; the 'one-bead one-compound' library 
method; and synthetic library methods using affinity 
chromatography selection. The biological library 
approach is limited to peptide libraries, while the other 

25 four approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam 
(1997) Anticancer Drug Des. 12:145) . 

Examples of methods for the synthesis of molecular 
libraries can be found in the art, for example in: 

30 DeWitt et al . (1993) Proc. Natl. Acad. Sci . U.S.A. 

90:6909; Erb et al . (1994) Proc. Natl. Acad. Sci. USA 
91:11422; Zuckermann et al . (1994). J . Med. Chem. 
37:2678; Cho et al . (1993) Science 261:1303; Carrell et 
al. (1994) Angew. Chem. Int. Ed. Engl. 33:2059; Carell et 
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al. (1994) Angew. Chem. Int. Ed. Engl. 33:2061; and 
Gallop et al. (1994) J. Afed. Chem. 37:1233. 

Libraries of compounds may be presented in 
solution (e.g., Houghten (1992) Bio/Techniques 13:412- 
5 421), or on beads (Lam (1991) Mature 354 : 82-84) , chips 
(Fodor (1993) Nature 364 : 555-556) , bacteria (U.S. Patent 
No. 5,223,409), spores (Patent Nos. 5,571,698; 5,403,484; 
and 5,223,409), plasmids (Cull et al . (1992) Proc. Natl. 
Acad. Sci. USA 89:1865-1869) or on phage (Scott and Smith 
10 (1990) Science 249:386-390; Devlin (1990) Science 

249:404-406; Cwirla et al . (1990) Proc. Natl. Acad. Sci. 
87:6378-6382; and Felici (1991) J. Mol. Biol. 222:301- 
310) . 

In one embodiment, an assay is a cell-based assay 

15 in which a cell which expresses T139 protein, or a 

biologically active portion thereof, is contacted with a 
test compound and the ability of the test compound to 
bind to a T139 protein determined. The cell, for 
example, can be a yeast cell or a cell of mammalian 

20 origin. Determining the ability of the test compound to 
bind to the T139 protein can be accomplished, for 
example, by coupling the test compound with a 
radioisotope or enzymatic label such that binding of the 
test compound to the T139 protein or biologically active 

25 portion thereof can be determined by detecting the 
labeled compound in a complex. For example, test 
compounds can be labeled with 125 I, 35 S, 14 C, or 3 H, either 
directly or indirectly, and the radioisotope detected by 
direct counting of radioemmission or by scintillation 

30 counting. Alternatively, test compounds can be 

enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or lucif erase, and the 
enzymatic label detected by determination of conversion 
of an appropriate substrate to product. In a preferred 

35 embodiment, the assay comprises contacting a cell which 
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expresses a T139 protein, or a biologically active 
portion thereof, with a known compound which binds T13 9 
to form an assay mixture, contacting the assay mixture 
with a test compound, and determining the ability of the 
5 test compound to interact with a T139 protein, wherein 
determining the ability of the test compound to interact 
with a T13 9 protein comprises determining the ability of 
the test compound to preferentially bind to T139 or a 
biologically active portion thereof as compared to the 

10 known compound. 

In another embodiment, an assay is a cell-based 
assay comprising contacting a cell expressing a T13 9 
protein, or a biologically active portion thereof, with a 
test compound and determining the ability of the test 

15 compound to modulate (e.g., stimulate or inhibit) the 
activity of the T139 protein or biologically active 
portion thereof. Determining the ability of the test 
compound to modulate the activity of T139 or a 
biologically active portion thereof can be accomplished, 

20 for example, by determining the ability of the T139 
protein to bind to or interact with a T139 target 
molecule. As used herein, a "target molecule" is a 
molecule with which a T139 protein binds or interacts in 
nature, for example, a T139 receptor, a molecule in the 

25 extracellular milieu, or a molecule associated with the 
external surface of a cell membrane. A T139 target 
molecule can be a non-T139 molecule or a T139 protein or 
polypeptide of the present invention. In one embodiment, 
a T139 target molecule is a component of a signal 

30 transduction pathway which facilitates transduction of an 
extracellular signal (e.g., a signal generated by binding 
of a T139 molecule to its receptor) through the cell 
membrane and into the cell. 

Determining the ability of the T139 protein to 

35 bind to or interact with a T139 target molecule can be 
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accomplished by one of the methods described above for 
determining direct binding. In a preferred embodiment, 
determining the ability of the T139 protein to bind to or 
interact with a T139 target molecule can be accomplished 
5 by determining the activity of the target molecule. For 
example, the activity of the target molecule can be 
determined by detecting induction of a cellular second 
messenger of the target (e.g., intracellular Ca 2+ , 
diacylglycerol , IP3, etc.), detecting catalytic/enzymatic 

10 activity of the target an appropriate substrate, 

detecting the induction of a reporter gene (e.g., a T139- 
responsive regulatory element operatively linked to a 
nucleic acid encoding a detectable marker, e.g. 
luciferase) , or detecting a cellular response, for 

15 example, cell survival, cellular differentiation, or cell 
prol i f e r a t i on . 

In yet another embodiment, an assay of the present 
invention is a cell-free assay comprising contacting a 
T139 protein or biologically active portion thereof with 

20 a test compound and determining the ability of the test 
compound to bind to the T139 protein or biologically 
active portion thereof. Binding of the test compound to 
the T13 9 protein can be determined either directly or 
indirectly as described above. In a preferred 

25 embodiment, the assay includes contacting the T139 
protein or biologically active portion thereof with a 
known compound which binds T139 to form an assay mixture, 
contacting the assay mixture with a test compound, and 
determining the ability of the test compound to interact 

30 with a T139 protein, wherein determining the ability of 
the test compound to interact with a T13 9 protein 
comprises determining the ability of the test compound to 
preferentially bind to T139 or biologically active 
portion thereof as compared to the known compound. 
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In another embodiment, an assay is a cell -free 
assay comprising contacting T139 protein or biologically 
active portion thereof with a test compound and 
determining the ability of the test compound to modulate 
5 (e.g., stimulate or inhibit) the activity of the T139 
protein or biologically active portion thereof. 
Determining the ability of the test compound to modulate 
the activity of T13 9 can be accomplished, for example, by 
determining the ability of the T13 9 protein to bind to a 

10 T139 target molecule by one of the methods described 

above for determining direct binding. In an alternative 
embodiment, determining the ability of the test compound 
to modulate the activity of T139 can be accomplished by 
determining the ability of the T139 protein further 

15 modulate a T139 target molecule. For example, the 

catalytic/enzymatic activity of the target molecule on an 
appropriate substrate can be determined as previously 
described. 

In yet another embodiment, the cell -free assay 

2 0 comprises contacting the T139 protein or biologically 

active portion thereof with a known compound which binds 
T139 to form an assay mixture, contacting the assay 
mixture with a test compound, and determining the ability 
of the test compound to interact with a T139 protein, 

25 wherein determining the ability of the test compound to 
interact with a T139 protein comprises determining the 
ability of the T139 protein to preferentially bind to or 
modulate the activity of a T139 target molecule. 

In some cell -based assays, it may be desirable to 

30 utilize a solubilizing agent such that a membrane -bound 
form of T139 is maintained in solution. Examples of such 
solubilizing agents include non-ionic detergents such as 
n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, 
octanoyl -N-methylglucamide , decanoyl -N-methylglucamide , 

35 Triton® X-100, Triton® X-114, Thesit®, 
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I sotridecypoly (ethylene glycol ether)n, 3-[(3- 
cholamidopropyl)dimethylamminio] -1-propane sulfonate 
(CHAPS) , 3 - [ (3-cholamidopropyl)dimethylamminio] -2- 
hydroxy- 1-propane sulfonate (CHAPSO) , or N-dodecyl=N,N- 
5 dimethyl-3-ammonio-l-propane sulfonate. 

In more than one embodiment of the above assay 
methods of the present invention, it may be desirable to 
immobilize either T139 or its target molecule to 
facilitate separation of complexed from uncomplexed forms 

10 of one or both of the proteins, as well as to accommodate 
automation of the assay. Binding of a test compound to 
T13 9, or interaction of T139 with a target molecule in 
the presence and absence of a candidate compound, can be 
accomplished in any vessel suitable for containing the 

15 reactants. Examples of such vessels include microtitre 
plates, test tubes, and micro- centrifuge tubes. In one 
embodiment, a fusion protein can be provided which adds a 
domain that allows one or both of the proteins to be 
bound to a matrix. For example, glutathione- S- 

20 transferase/T139 fusion proteins or glutathione-S- 

transferase/target fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical; St. Louis, 
MO) or glutathione derivatized microtitre plates, which 
are then combined with the test compound or the test 

25 compound and either the non- adsorbed target protein or 
T13 9 protein, and the mixture incubated under conditions 
conducive to complex formation (e.g., at physiological 
conditions for salt and pH) . Following incubation, the 
beads or microtitre plate wells are washed to remove any 

30 unbound components, the matrix immobilized in the case of 
beads, complex determined either directly or indirectly, 
for example, as described above. Alternatively, the 
complexes can be dissociated from the matrix, and the 
level of T139 binding or activity determined using 

35 standard techniques. 
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protein expression. Alternatively, when expression of 
T139 mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in 
its absence, the candidate compound is identified as an 
5 inhibitor of T139 mRNA or protein expression. The level 
of T13 9 mRNA or protein expression in the cells can be 
determined by methods described herein for detecting T139 
mRNA or protein. 

In yet another aspect of the invention, the T139 

10 proteins can be used as "bait proteins" in a two-hybrid 
assay or three hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al . (1993) Cell 72:223-232; Madura 
et al- (1993) J". Biol. Chew. 268:12046-12054; Bartel et 
al. (1993) Bio/Techniques 14:920-924; Iwabuchi et al . 

15 (1993) Oncogene 8:1693-1696; and WO 94/10300), to 

identify other proteins, which bind to or interact with 
T139 ("T13 9 -binding proteins" or "T139-bp") and modulate 
T139 activity. Such T139-binding proteins are also 
likely to be involved in the propagation of signals by 

20 the T139 proteins as, for example, upstream or downstream 
elements of the T139 pathway. 

The two-hybrid system is based on the modular 
nature of most transcription factors, which consist of 
separable DNA-binding and activation domains. Briefly, 

25 the assay utilizes two different DNA constructs. In one 
construct, the gene that codes for T139 is fused to a 
gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4) . In the other 
construct, a DNA sequence, from a library of DNA 

30 sequences, that encodes an unidentified protein ("prey" 
or "sample") is.fused to a gene that codes for the 
activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, 
in vivo, forming an T13 9 -dependent complex, the DNA- 

35 binding and activation domains of the transcription 
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factor are brought into close proximity. This proximity- 
allows transcription of a reporter gene (e.g., LacZ) 
which is operably linked to a transcriptional regulatory 
site responsive to the transcription factor. Expression 
5 of the reporter gene can be detected and cell colonies 
containing the functional transcription factor can be 
isolated and used to obtain the cloned gene which encodes 
the protein which interacts with T139. 

This invention further pertains to novel agents 
10 identified by the above -described screening assays and 
uses thereof for treatments as described herein. 

B. Detection Assays 

Portions or fragments of the cDNA sequences 
identified herein (and the corresponding complete gene 

15 sequences) can be used in numerous ways as polynucleotide 
reagents. For example, these sequences can be used to: 
(i) map their respective genes on a chromosome; and, 
thus, locate gene regions associated with genetic 
disease; (ii) identify an individual from a minute 

20 biological sample (tissue typing) ; and (iii) aid in 

forensic identification of a biological sample. These 
applications are described in the subsections below. 

l . Chromosome Mapping 

Once the sequence (or a portion of the sequence) 
25 of a gene has been isolated, this sequence can be used to 
map the location of the gene on a chromosome. 
Accordingly, T139 nucleic acid molecules described herein 
or fragments thereof, can be used to map the location of 
T139 genes on a chromosome. The mapping of the T13 9 
30 sequences to chromosomes is an important first step in 
correlating these sequences with genes associated with 
disease. 
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Briefly, T139 genes can be mapped to chromosomes 
by preparing PCR primers (preferably 15-25 bp in length) 
from the T139 sequences. Computer analysis of T139 
sequences can be used to rapidly select primers that do 
5 not span more than one exon in the genomic DNA, thus 
complicating the amplification process. These primers 
can then be used for PCR screening of somatic cell 
hybrids containing individual human chromosomes. Only 
those hybrids containing the human gene corresponding to 

10 the T139 sequences will yield an amplified fragment. 

Somatic cell hybrids are prepared by fusing 
somatic cells from different mammals (e.g., human and 
mouse cells) . As hybrids of human and mouse cells grow 
and divide, they gradually lose human chromosomes in 

15 random order, but retain the mouse chromosomes. By using 
media in which mouse cells cannot grow, because they lack 
a particular enzyme, but human cells can, the one human 
chromosome that contains the gene encoding the needed 
enzyme, will be retained. By using various media, panels 

2 0 of hybrid cell lines can be established. Each cell line 
in a panel contains either a single human chromosome or a 
small number of human chromosomes, and a full set of 
mouse chromosomes, allowing easy mapping of individual 
genes to specific human chromosomes. (D'Eustachio et al . 

25 (1983) Science 220:919-924). Somatic cell hybrids 

containing only fragments of human chromosomes can also 
be produced by using human chromosomes with 
translocations and deletions. 

PCR mapping of somatic cell hybrids is a rapid 

30 procedure for assigning a particular sequence to a 

particular chromosome. Three or more sequences can be 
assigned per day using a single thermal cycler. Using 
the T139 sequences to design oligonucleotide primers, 
sublocalization can be achieved with panels of fragments 

35 from specific chromosomes. Other mapping strategies 
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which can similarly be used to map a T139 sequence to its 
chromosome include in situ hybridization (described in 
Fan et al . (1990) Proc. Natl. Acad. Sci. USA 87:6223-27) , 
pre-screening with labeled flow-sorted chromosomes, and 
5 pre-selection by hybridization to chromosome specific 
cDNA libraries. 

Fluorescence in situ hybridization (FISH) of a DNA 
sequence to a metaphase chromosomal spread can further be 
used to provide a precise chromosomal location in one 

10 step. Chromosome spreads can be made using cells whose 
division has been blocked in metaphase by a chemical like 
colcemid that disrupts the mitotic spindle. The 
chromosomes can be treated briefly with trypsin, and then 
stained with Giemsa. A pattern of light and dark bands 

15 develops on each chromosome, so that the chromosomes can 
be identified individually. The FISH technique can be 
used with a DNA sequence as short as 500 or 600 bases. 
However, clones larger than 1,000 bases have a higher 
likelihood of binding to a unique chromosomal location 

20 with sufficient signal intensity for simple detection. 
Preferably 1,000 bases, and more preferably 2,000 bases 
will suffice to get good results at a reasonable amount 
of time. For a review of this technique, see Verma et 
al . , Human Chromosomes : A Manual of Basic Techniques 

25 (Pergamon Press, New York, 1988) . 

Reagents for chromosome mapping can be used 
individually to mark a single chromosome or a single site 
on that chromosome, or panels of reagents can be used for 
marking multiple sites and/or multiple chromosomes. 

30 Reagents corresponding to noncoding regions of the genes 
actually are preferred for mapping purposes. Coding 
sequences are more likely to be conserved within gene 
families, thus increasing the chance of cross 
hybridizations during chromosomal mapping. 
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Once a sequence has been mapped to a precise 
chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic 
map data. (Such data are found, for example, in V. 
5 McKusick, Mendelian Inheritance in Man, available on-line 
through Johns Hopkins University Welch Medical Library) . 
The relationship between genes and disease, mapped to the 
same chromosomal region, can then be identified through 
linkage analysis (co- inheritance of physically adjacent 

10 genes), described in, e.g., Egeland et al. (1987) Nature, 
325:783-787. - 

Moreover, differences in the DNA sequences between 
individuals affected and unaffected with a disease 
associated with the T13 9 gene can be determined. If a 

15 mutation is observed in some or all of the affected 

individuals but not in any unaffected individuals, then 
the mutation is likely to be the causative agent of the 
particular disease. Comparison of affected and 
unaffected individuals generally involves first looking 

20 for structural alterations in the chromosomes such as 
deletions or translocations that are visible from 
chromosome spreads or detectable using PCR based on that 
DNA sequence. Ultimately, complete sequencing of genes 
from several individuals can be performed to confirm the 

25 presence of a mutation and to distinguish mutations from 
polymorphisms . 

2. Tissue Typing 

The T139 sequences of the present invention can 
also be used to identify individuals from minute 
30 biological samples. The United States military, for 
example, is considering the use of restriction fragment 
length polymorphism (RFLP) for identification of its 
personnel. In this technique, an individual's genomic 
DNA is digested with one or more restriction enzymes, and 
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probed on a Southern blot to yield unique bands for 
identification. This method does not suffer from the 
current limitations of "Dog Tags" which can be lost, 
switched, or stolen, making positive identification 
5 difficult. The sequences of the present invention are 
useful as additional DNA markers for RFLP (described in 
U.S. Patent 5,272,057). 

Furthermore, the sequences of the present 
invention can be used to provide an alternative technique 

10 which determines the actual base-by-base DNA sequence of 
selected portions of an individual's genome. Thus, the 
T13 9 sequences described herein can be used to prepare 
two PGR primers from the 5' and 3' ends of the sequences. 
These primers can then be used to amplify an individual's 

15 DNA and subsequently sequence it. 

Panels of corresponding DNA sequences from 
individuals, prepared in this manner, can provide unique 
individual identifications, as each individual will have 
a unique set of such DNA sequences due to allelic 

20 differences. The sequences of the present invention can 
be used to obtain such identification sequences from 
individuals and from tissue. The T139 sequences of the 
invention uniquely represent portions of the human 
genome. Allelic variation occurs to some degree in the 

25 coding regions of these sequences, and to a greater 
degree in the noncoding regions. It is estimated that 
allelic variation between individual humans occurs with a 
frequency of about once per each 500 bases. Each of the 
sequences described herein can, to some degree, be used 

30 as a standard against which DNA from an individual can be 
compared for identification purposes. Because greater 
numbers of polymorphisms occur in the noncoding regions, 
fewer sequences are necessary to differentiate 
individuals. The noncoding sequences of SEQ ID N0:1 can 

35 comfortably provide positive individual identification 
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with a panel of perhaps 10 to 1,000 primers which each 
yield a noncoding amplified sequence of 100 bases. If 
predicted coding sequences, such as those in SEQ ID NO: 3 
are used, a more appropriate number of primers for 
5 positive individual identification would be 500-2,000. 
If a panel of reagents from T13 9 sequences 
described herein is used to generate a unique 
identification database for an individual, those same 
reagents can later be used to identify tissue from that 
10 individual. Using the unique identification database, 
positive identification of the individual, living or 
dead, can be made from extremely small tissue samples. 

^. Use of Partial T139 Sequences in Forensic 

Biology 

15 DNA-based identification techniques can also be 

used in forensic biology. Forensic biology is a 
scientific field employing genetic typing of biological 
evidence found at a crime scene as a means for positively 
identifying, for example, a perpetrator of a crime. To 

20 make such an identification, PCR technology can be used 
to amplify DNA sequences taken from very small biological 
samples such as tissues, e.g., hair or skin, or body 
fluids, e.g., blood, saliva, or semen found at a crime 
scene. The amplified sequence can then be compared to a 

25 standard, thereby allowing identification of the origin 
of the biological sample. 

The sequences of the present invention can be used 
to provide polynucleotide reagents, e.g., PCR primers, 
targeted to specific loci in the human genome, which can 

30 enhance the reliability of DNA-based forensic 

identifications by, for example, providing another 
"identification marker" {i.e. another DNA sequence that 
is unique to a particular individual) . As mentioned 
above, actual base sequence information can be used for 
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identification as an accurate alternative to patterns 
formed by restriction enzyme generated fragments. 
Sequences targeted to noncoding regions of SEQ ID N0:1 
are particularly appropriate for this use as greater 
5 numbers of polymorphisms occur in the noncoding regions, 
making it easier to differentiate individuals using this 
technique. Examples of polynucleotide reagents include 
the T139 sequences or portions thereof, e.g., fragments 
derived from the noncoding regions of SEQ ID N0:1 having 

10 a length of at least 20 or 30 bases. 

The T139 sequences described herein can,, further be 
used to provide polynucleotide reagents, e.g., labeled or 
labelable probes which can be used in, for example, an in 
situ hybridization technique, to identify a specific 

15 tissue, e.g., brain tissue. This can be very useful in 
cases where a forensic pathologist is presented with a 
tissue of unknown origin. Panels of such T139 probes can 
be used to identify tissue by species and/or by organ 
type. 

20 In a similar fashion, these reagents, e.g., T139 

primers or probes can be used to screen tissue culture 
for contamination (i.e., screen for the presence of a 
mixture of different types of cells in a culture) . 

C, Predictive Medicine 

25 The present invention also pertains to the field 

of predictive medicine in which diagnostic assays, 
prognostic assays, pharmacogenomics, and monitoring 
clinical trails are used for prognostic (predictive) 
purposes to thereby treat an individual prophylactically . 

30 Accordingly, one aspect of the present invention relates 
to diagnostic assays for determining T139 protein and/or 
nucleic acid expression as well as T13 9 activity, in the 
context of a biological sample (e.g., blood, serum, 
cells, tissue) to thereby determine whether an individual 
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is afflicted with a disease or disorder, or is at risk of 
developing a disorder, associated with aberrant T139 
expression or activity. The invention also provides for 
prognostic (or predictive) assays for determining whether 
5 an individual is at risk of developing a disorder 

associated with T13 9 protein, nucleic acid expression or 
activity. For example, mutations in a T139 gene can be 
assayed in a biological sample. Such assays can be used 
for prognostic or predictive purpose to thereby 

10 prophylactically treat an individual prior to the onset 
of a disorder characterized by or associated- with T139 
protein, nucleic acid expression or activity. 

Another aspect of the invention provides methods 
for determining T139 protein, nucleic acid expression or 

15 T13 9 activity in an individual to thereby select 

appropriate therapeutic or prophylactic agents for that 
individual (referred to herein as "pharmacogenomics") . 
Pharmacogenomics allows for the selection of agents 
(e.g., drugs) for therapeutic or prophylactic treatment 

20 of an individual based on the genotype of the individual 
(e.g., the genotype of the individual examined to 
determine the ability of the individual to respond to a 
particular agent.) 

Yet another aspect of the invention pertains to 

25 monitoring the influence of agents (e.g., drugs or other 
compounds) on the expression or activity of T139 in 
clinical trials. 

These and other agents are described in further 
detail in the following sections. 

30. 1. Diagnostic Assays 

An exemplary method for detecting the presence or 
absence of Tl 3 9 in a biological sample involves obtaining 
a biological sample from a test subject and contacting 
the biological sample with a compound or an agent capable 
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as well as in vivo. For example, in vitro techniques for 
detection of T139 mRNA include Northern hybridizations 
and in situ hybridizations. In vitro techniques for 
detection of T139 protein include enzyme linked 
5 immunosorbent assays (ELlSAs) , Western blots, 

immunoprecipitations and immunofluorescence. In vitro 
techniques for detection of T139 genomic DNA include 
Southern hybridizations. Furthermore, in vivo techniques 
for detection of T139 protein include introducing into a 

10 subject a labeled anti-T139 antibody. For example, the 
antibody can be labeled with a radioactive marker whose 
presence and location in a subject can be detected by 
standard imaging techniques. 

In one embodiment, the biological sample contains 

15 protein molecules from the test subject. Alternatively, 
the biological sample can contain mRNA molecules from the 
test subject or genomic DNA molecules from the test 
subject. A preferred biological sample is a peripheral 
blood leukocyte sample isolated by conventional means 

2 0 from a subject. 

In another embodiment, the methods further involve 
obtaining a control biological sample from a control 
subject, contacting the control sample with a compound or 
agent capable of detecting T139 protein, mRNA, or genomic 

25 DNA, such that the presence of T139 protein, mRNA or 
genomic DNA is detected in the biological sample, and 
comparing the presence of T139 protein, mRNA or genomic 
DNA in the control sample with the presence of T139 
protein, mRNA or genomic DNA in the test sample. 

30 The invention also encompasses kits for detecting 

the presence of T139 in a biological sample (a test 
sample) . Such kits can be used to determine if a subject 
is suffering from or is at increased risk of developing a 
disorder associated with aberrant expression of T139. 

3 5 For example, the kit can comprise a labeled compound or 
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agent capable of detecting T13 9 protein or mRNA in a 
biological sample and means for determining the amount of 
T139 in the sample (e.g., an anti-T139 antibody or an 
oligonucleotide probe which binds to DNA encoding T139, 
5 e.g., SEQ ID N0:1 or SEQ ID NO: 3) . Kits may also include 
instruction for observing that the tested subject is 
suffering from or is at risk of developing a disorder 
associated with aberrant expression of T13 9 if the amount 
of T139 protein or mRNA is above or below a normal level. 

10 For antibody-based kits, the kit may comprise, for 

example: (1) a first antibody (e.g., attached to a solid 
support) which binds to T139 protein; and, optionally, 
(2) a second, different antibody which binds to T139 
protein or the first antibody and is conjugated to a 

15 detectable agent. 

For oligonucleotide -based kits, the kit may 
comprise, for example: (1) a oligonucleotide, e.g., a 
detectably labelled oligonucleotide, which hybridizes to 
a T139 nucleic acid sequence or (2) a pair of primers 

20 useful for amplifying a T139 nucleic acid molecule. 

The kit may also comprise, e.g./ a buffering 
agent, a preservative, or a protein stabilizing agent. 
The kit may also comprise components necessary for 
detecting the detectable agent (e.g., an enzyme or a 

25 substrate) . The kit may also contain a control sample or 
a series of control samples which can be assayed and 
compared to the test sample contained. Each component of 
the kit is usually enclosed within an individual 
container and all of the various containers are within a 

30 single package along with instructions for observing 
whether the tested subject is suffering from or is at 
risk of developing a disorder associated with aberrant 
expression or activity of T139. 
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2 . Prognostic Assays 

The methods described herein can furthermore be 
utilized as diagnostic or prognostic assays to identify 
subjects having or at risk of developing a disease or 
5 disorder associated with aberrant T13 9 expression or 

activity. For example, the assays described herein, such 
as the preceding diagnostic assays or the following 
assays, can be utilized to identify a subject having or 
at risk of developing a disorder associated with T139 

10 protein, nucleic acid expression or activity. 

Alternatively, the prognostic assays can --be^ utilized to 
identify a subject having or at risk for developing such 
a disease or disorder. Thus, the present invention 
provides a method in which a test sample is obtained from 

15 a subject and T139 protein or nucleic acid (e.g., mRNA, 
genomic DNA) is detected, wherein the presence of T139 
protein or nucleic acid is diagnostic for a subject 
having or at risk of developing a disease or disorder 
associated with aberrant T139 expression or activity. As 

20 used herein, a "test sample" refers to a biological 

sample obtained from a subject of interest. For example, 
a test sample can be a biological fluid (e.g., serum), 
cell sample, or tissue. 

Furthermore, the prognostic assays described 

25 herein can be used to determine whether a subject can be 
administered an agent (e.g., an agonist, antagonist, 
peptidomimetic, protein, peptide, nucleic acid, small 
molecule, or other drug candidate) to treat a disease or 
disorder associated with aberrant T13 9 expression or 

30 activity. For example, such methods can be used to 

determine whether a subject can be effectively treated 
with a specific agent or class of agents (e.g., agents of 
a type which decrease T13 9 activity) . Thus, the present 
invention provides methods for determining whether a 

35 subject can be effectively treated with an agent for a 
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disorder associated with aberrant T139 expression or 
activity in which a test sample is obtained and T13 9 
protein or nucleic acid is detected (e.g., wherein the 
presence of T139 protein or nucleic acid is diagnostic 
5 for a subject that can be administered the agent to treat 
a disorder associated with aberrant T139 expression or 
activity) . 

The methods of the invention can also be used to 
detect genetic lesions or mutations in a T139 gene, 

10 thereby determining if a subject with the lesioned gene 
is at risk for a disorder characterized by aberrant cell 
proliferation and/or differentiation. In preferred 
embodiments, the methods include detecting, in a sample 
of cells from the subject, the presence or absence of a 

15 genetic lesion characterized by at least one of an 

alteration affecting the integrity of a gene encoding a 
T139~protein, or the mis- expression of the T139 gene. 
For example, such genetic lesions can be detected by 
ascertaining the existence of at least one of 1) a 

20 deletion of one or more nucleotides from a T13 9 gene; 2) 
an addition of one or more nucleotides to a T139 gene; 3) 
a substitution of one or more nucleotides of a T139 gene, 
4) a chromosomal rearrangement of a T139 gene; 5) an 
alteration in the level of a messenger RNA transcript of 

25 a T139 gene, 6) aberrant modification of a T139 gene, 

such as of the methylation pattern of the genomic DNA, 7) 
the presence of a non-wild type splicing pattern of a 
messenger RNA transcript of a T13 9 gene, 8) a non-wild 
type level of a T139-protein, 9) allelic loss of a T139 

30 gene, and 10) inappropriate post-translational 

modification of a T139-protein. As described herein, 
there are a large number of assay techniques known in the 
art which can be used for detecting lesions in a T13 9 
gene. A preferred biological sample is a peripheral 
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blood leukocyte sample isolated by conventional means 
from a subject. 

In certain embodiments, detection of the lesion 
involves the use of a probe/primer in a polymerase chain 
5 reaction (PCR) (see, e.g., U.S. Patent Nos. 4,683,195 and 
4,683,202), such as anchor PCR or RACE PCR, or, 
alternatively, in a ligation chain reaction (LCR) (see, 
e.g., Landegran et al. (1988) Science 241:1077-1080; and 
Nakazawa et al . (1994) Proc. Natl. Acad. Sci. USA 91:360- 

10 364) , the latter of which can be particularly useful for 
detecting point mutations in the T139-gene, (see Abravaya 
et al. (1995) Nucleic Acids Res. 23:675-682). This 
method can include the steps of collecting a sample of 
cells from a patient, isolating nucleic acid (e.g., 

15 genomic, mRNA or both) from the cells of the sample, 
contacting the nucleic acid sample with one or more 
primers which specifically hybridize to a T139 gene under 
conditions such that hybridization and amplification of 
the T139-gene (if present) occurs, and detecting the 

20 presence or absence of an amplification product, or 
detecting the size of the amplification product and 
comparing the length to a control sample. It is 
anticipated that PCR and/or LCR may be desirable to use 
as a preliminary amplif ication step in conjunction with 

25 any of the techniques used for detecting mutations 
described herein. 

Alternative amplification methods include: self 
sustained sequence replication (Guatelli et al. (1990) 
Proc. Natl. Acad. Sci. USA 87:1874-1878), transcriptional 

30 amplification system (Kwoh, et al. (1989) Proc. Natl. 
Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi 
et al. (1988) Bio/Technology 6:1197), or any other 
nucleic acid amplification method, followed by the 
detection of the amplified molecules using techniques 

35 well known to those of skill in the art. These detection 
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schemes are especially useful for the detection of 
nucleic acid molecules if such molecules are present in 
very low numbers. 

In an alternative embodiment, mutations in a T139 
5 gene from a sample cell can be identified by alterations 
in restriction enzyme cleavage patterns. For example, 
sample and control DNA is isolated, amplified 
(optionally) , digested with one or more restriction 
endonucleases, and fragment length sizes are determined 

10 by gel electrophoresis and compared. Differences in 
fragment length sizes between sample and -control DNA 
indicates mutations in the sample DNA. Moreover, the use 
of sequence specific ribozymes (see, e.g., U.S. Patent 
No. 5,498,531) can be used to score for the presence of 

15 specific mutations by development or loss of a ribozyme 
cleavage site. 

In other embodiments, genetic mutations in T139 
can be identified by hybridizing a sample and control 
nucleic acids, e.g., DNA or RNA, to high density arrays 

20 containing hundreds or thousands of oligonucleotides 
probes (Cronin et al . (1996) Human Mutation 7:244-255; 
Kozal et al . (1996) Nature Medicine 2:753-759). For 
example, genetic mutations in T139 can be identified in 
two-dimensional arrays containing light -generated DNA 

25 probes as described in Cronin et al . supra. Briefly, a 
first hybridization array of probes can be used to scan 
through long stretches of DNA in a sample and control to 
identify base changes between the sequences by making 
linear arrays of sequential overlapping probes. This 

30 step allows the identification of point mutations. This 
step is followed by a second hybridization array that 
allows the characterization of specific mutations by 
using smaller, specialized probe arrays complementary to 
all variants or mutations detected. Each mutation array 

35 is composed of parallel probe sets, one complementary to 
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the wild- type gene and the other complementary to the 

mutant gene. 

In yet another embodiment, any of a variety of 
sequencing reactions known in the art can be used to 
5 directly sequence the T139 gene and detect mutations by 
comparing the sequence of the sample T13 9 with the 
corresponding wild- type (control) sequence. Examples of 
sequencing reactions include those based on techniques 
developed by Maxim and Gilbert ((1977) Proc. Natl. Acad. 

10 Sci. USA 74:560) or Sanger ((1977) Proc. Natl. Acad. 

Sci. USA 74:5463). It is also contemplated that any of a 
variety of automated sequencing procedures can be 
utilized when performing the diagnostic assays ( (1995) 
Bio/Techniques 19:448), including sequencing by mass 

15 spectrometry (see, e.g., PCT Publication No. WO 94/16101; 
Cohen et al. (1996) Adv. Chromatogr. 36:127-162; and 
Griffin et al . (1993) Appl. Biochem. Biotechnol. 38:147- 
159) . 

Other methods for detecting mutations in the T139 

2 0 gene include methods in which protection from cleavage 
agents is used to detect mismatched bases in RNA/RNA or 
RNA/DNA heteroduplexes (Myers et al. (1985) Science 
230:1242) . In general, the art technique of "mismatch 
cleavage" starts by providing heteroduplexes of formed by 

25 hybridizing (labeled) RNA or DNA containing the wild-type 
T139 sequence with potentially mutant RNA or DNA obtained 
from a tissue sample. The double -stranded duplexes are 
treated with an agent which cleaves single-stranded 
regions of the duplex such as which will exist due to 

30 basepair mismatches between the control and sample 

strands. For instance, RNA/DNA duplexes can be treated 
with RNase and DNA/ DNA hybrids treated with SI nuclease 
to enzymatically digesting the mismatched regions. In 
other embodiments, either DNA/ DNA or RNA/DNA duplexes can 

35 be treated with hydroxylamine or osmium tetroxide and 
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with piperidine in order to digest mismatched regions. 
After digestion of the mismatched regions, the resulting 
material is then separated by size on denaturing 
polyacrylamide gels to determine the site of mutation. 
5 See, e.g./ Cotton et al (1988) Proc. Natl Acad Sci USA 
85:4397; Saleeba et al (1992) Methods Enzymol. 217:286- 
295. In a preferred embodiment, the control DNA or RNA 
can be labeled for detection. 

In still another embodiment, the mismatch cleavage 

10 reaction employs one or more proteins that recognize 
mismatched base pairs in double- stranded- DNA (so called 
tt DNA mismatch repair" enzymes) in defined systems for 
detecting and mapping point mutations in T139 cDNAs 
obtained from samples of cells. For example, the mutY 

15 enzyme of E. coli cleaves A at G/A mismatches and the 
thymidine DNA glycosylase from HeLa cells cleaves T at 
G/T mismatches (Hsu et al. (1994) Carcinogenesis 15:1657- 
1662) . According to an exemplary embodiment, a probe 
based on a T139 sequence, e.g., a wild-type T139 

2 0 sequence, is hybridized to a cDNA or other DNA product 
from a test cell(s) . The duplex is treated with a DNA 
mismatch repair enzyme, and the cleavage products, if 
any, can be detected from electrophoresis protocols or 
the like. See, e.g., U.S. Patent No. 5,459,039. 

25 In other embodiments, alterations in 

electrophoretic mobility will be used to identify 
mutations in T139 genes. For example, single strand 
conformation polymorphism (SSCP) may be used to detect 
differences in electrophoretic mobility between mutant 

30 and wild type nucleic acids (Orita et al . (1989) Proc 
Natl. Acad. Sci USA: 86:2766; see also Cotton (1993) 
Mutat. Res. 285:125-144; and Hayashi (1992) Genet Anal 
Tech Appl 9:73-79) . Single -stranded DNA fragments of 
sample and control T13 9 nucleic acids will be denatured 

35 and allowed to renature. The secondary structure of 
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single- stranded nucleic acids varies according to 
sequence, the resulting alteration in electrophoretic 
mobility enables the detection of even a single base 
change- The DNA fragments may be labeled or detected 
5 with labeled probes. The sensitivity of the assay may be 
enhanced by using RNA (rather than DNA) , in which the 
secondary structure is more sensitive to a change in 
sequence. In a preferred embodiment, the subject method 
utilizes heteroduplex analysis to separate double 
10 stranded heteroduplex molecules on the basis of changes 
in electrophoretic mobility (Keen et--ai-;-- (1991) Trends 
Genet 7:5). 

In yet another embodiment , the movement of mutant 
or wild- type fragments in polyacryl amide gels containing 

15 a gradient of denaturant is assayed using denaturing 

gradient gel electrophoresis (DGGE) (Myers et al . (1985) 
Nature 313:495) . When DGGE is used as the method of 
analysis, DNA will be modified to insure that it does not 
completely denature, for example by adding a GC clamp of 

20 approximately 40 bp of high-melting GC-rich DNA by PCR. 
In a further embodiment, a temperature gradient is used 
in place of a denaturing gradient to identify differences 
in the mobility of control and sample DNA (Rosenbaum and 
Reissner (1987) Biophys Chem 265:12753). 

25 Examples of other techniques for detecting point 

mutations include, but are not limited to, selective 
oligonucleotide hybridization, selective amplification, 
or selective primer extension. For example, 
oligonucleotide primers may be prepared in which the 

30 known mutation is placed centrally and then hybridized to 
target DNA under conditions which permit hybridization 
only if a perfect match is found (Saiki et al . (1986) 
Nature 324:163); Saiki et al . (1989) Proc. Natl Acad. Sci 
USA 86:6230). Such allele specific oligonucleotides are 

35 hybridized to PCR amplified target DNA or a number of 
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different mutations when the oligonucleotides are 
attached to the hybridizing membrane and hybridized with 
labeled target DNA. 

Alternatively, allele specific amplification 
5 technology which depends on selective PCR amplification 
may be used in conjunction with the instant invention. 
Oligonucleotides used as primers for specific 
amplification may carry the mutation of interest in the 
center of the molecule (so that amplification depends on 

10 differential hybridization) (Gibbs et al. (1989) Nucleic 
Acids Res. 17:2437-2448) or at the extreme 3' end of one 
primer where, under appropriate conditions, mismatch can 
prevent, or reduce polymerase extension (Prossner (1993) 
Tibtech 11:238). In addition, it may be desirable to 

15 introduce a novel restriction site in the region of the 
mutation to create cleavage-based detection (Gasparini et 
al. (1992) Mol. Cell Probes 6:1). It is anticipated that 
in certain embodiments amplification may also be 
performed using Taq ligase for amplification (Barany 

20 (1991) Proc. Natl. Acad. Sci USA 88:189). In such cases, 
ligation will occur only if there is a perfect match at 
the 3' end of the 5' sequence making it possible to 
detect the presence of a known mutation at a specific 
site by looking for the presence or absence of 

25 amplification. 

The methods described herein may be performed, for 
example, by utilizing pre-packaged diagnostic kits 
comprising at least one probe nucleic acid or antibody 
reagent described herein, which may be conveniently used, 

30 e.g., in clinical settings to diagnose patients 

exhibiting symptoms or family history of a disease or 
illness involving a T139 gene. 

Furthermore, any cell type or tissue, preferably 
peripheral blood leukocytes, in which T13 9 is expressed 
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transmitted as a single factor altering the way drugs act 
on the body (altered drug action) or genetic conditions 
transmitted as single factors altering the way the body 
acts on drugs (altered drug metabolism) . These 
5 pharmacogenetic conditions can occur either as rare 
defects or as polymorphisms. For example, glucose-6- 
phosphate dehydrogenase deficiency (G6PD) is a common 
inherited enzymopathy in which the main clinical 
complication is haemolysis after ingestion of oxidant 

10 drugs (anti-malarials, sulfonamides, analgesics, 
nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of 
drug metabolizing enzymes is a major determinant of both 
the intensity and duration of drug action. The discovery 

15 of genetic polymorphisms of drug metabolizing enzymes 

(e.g., N-acetyltransferase 2 (NAT 2) and cytochrome P450 
enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug 
effects or show exaggerated drug response and serious 

20 toxicity after taking the standard and safe dose of a 
drug. These polymorphisms are expressed in two 
phenotypes in the population, the extensive metabolizer 
(EM) and poor metabolizer (PM) . The prevalence of PM is 
different among different populations. For example, the 

25 gene coding for CYP2D6 is highly polymorphic and several 
mutations have been identified in PM, which all lead to 
the absence of functional CYP2D6 . Poor metabolizers of 
CYP2D6 and CYP2C19 quite frequently experience 
exaggerated drug response and side effects when they 

30 receive standard doses. If a metabolite is the active 
therapeutic moiety, PM show no therapeutic response, as 
demonstrated for the analgesic effect of codeine mediated 
by its CYP2D6-formed metabolite morphine. The other 
extreme are the so called ultra-rapid metabolizers who do 

35 not respond to standard doses. Recently, the molecular 
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basis of ultra-rapid metabolism has been identified to be 
due to CYP2D6 gene amplification. 

Thus, the activity of T139 protein, expression of 
T13 9 nucleic acid, or mutation content of T13 9 genes in 
5 an individual can be determined to thereby select 
appropriate agent (s) for therapeutic or prophylactic 
treatment of the individual. In addition, 
pharmacogenetic studies can be used to apply genotyping 
of polymorphic alleles encoding drug -metabolizing enzymes 

10 to the identification of an individual's drug 

responsiveness phenotype. This knowledge, when applied 
to dosing or drug selection, can avoid adverse reactions 
or therapeutic failure and thus enhance therapeutic or 
prophylactic efficiency when treating a subject with a 

15 T139 modulator, such as a modulator identified by one of 
the exemplary screening assays described herein. 

4 . Monitoring of Effects During Clinical Trials 
Monitoring the influence of agents (e.g., drugs, 
compounds) on the expression or activity of T139 (e.g., 

20 the ability to modulate aberrant cell proliferation 

and/or differentiation) can be applied not only in basic 
drug screening, but also in clinical trials. For 
example, the effectiveness of an agent determined by a 
screening assay as described herein to increase T139 gene 

25 expression, protein levels, or upregulate T139 activity, 
can be monitored in clinical trails of subjects 
exhibiting decreased T139 gene expression, protein 
levels, or downregulated T139 activity. Alternatively, 
the effectiveness of an agent determined by a screening 

30 assay to decrease T139 gene expression, protein levels, . 
or downregulated T139 activity, can be monitored in 
clinical trails of subjects exhibiting increased T13 9 
gene expression, protein levels, or upregulated T139 
activity. In such clinical trials, the expression or 



WO 99/54343 PCT/US99/08896 

- 88 - 

activity of T139 and, preferably, other genes that have 
been implicated in, for example, a cellular proliferation 
disorder can be used as a "read out" or markers of the 
immune responsiveness of a particular cell. 
5 For example, and not by way of limitation, genes, 

including T139, that are modulated in cells by treatment 
with an agent (e.g., compound, drug or small molecule) 
which modulates T139 activity (e.g., identified in a 
screening assay as described herein) can be identified. 

10 Thus, to study the effect of agents on cellular 

proliferation disorders, for examp-ley in a clinical 
trial, cells can be isolated and RNA prepared and 
analyzed for the levels of expression of T139 and other 
genes implicated in the disorder. The levels of gene 

15 expression (i.e., a gene expression pattern) can be 
quantified by Northern blot analysis or RT-PCR, as 
described herein, or alternatively by measuring the 
amount of protein produced, by one of the methods as 
described herein, or by measuring the levels of activity 

20 of T139 or other genes. In this way, the gene expression 
pattern can serve as a marker, indicative of the 
physiological response of the cells to the agent . 
Accordingly, this response state may be determined 
before, and at various points during, treatment of the 

25 individual with the agent. 

In a preferred embodiment, the present invention 
provides a method for monitoring the effectiveness of 
treatment of a subject with an agent (e.g., an agonist, 
antagonist, peptidomimetic, protein, peptide, nucleic 

30 acid, small molecule, or other drug candidate identified 
by the screening assays described herein) comprising the 
steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) 
detecting the level of expression of a T13 9 protein, 

35 mRNA, or genomic DNA in the preadministration sample; 
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(iii) obtaining one or more post -administration samples 
from the subject; (iv) detecting the level of expression 
or activity of the T139 protein, mRNA, or genomic DNA in 
the post-administration samples; (v) comparing the level 
5 of expression or activity of the T139 protein, mRNA, or 
genomic DNA in the pre-administration sample with the 
T139 protein, mRNA, or genomic DNA in the post 
administration sample or samples; and (vi) altering the 
administration of the agent to the subject accordingly. 

10 For example, increased administration of the agent may be 
desirable to increase the expression or activity of T139 
to higher levels than detected, i.e., to increase the 
effectiveness of the agent. Alternatively, decreased 
administration of the agent may be desirable to decrease 

15 expression or activity of T139 to lower levels than 
detected, i.e., to decrease the effectiveness of the 
agent . 

C. Methods of Treatment 
The present invention provides for both 
20 prophylactic and therapeutic methods of treating a 
subject at risk of (or susceptible to) a disorder or 
having a disorder associated with aberrant T13 9 
expression or activity. Such disorders include kidney 
defects such as kidney failure or hyperplasia. 

25 1 . Prophylactic Methods 

In one aspect, the invention provides a method for 
preventing in a subject, a disease or condition 
associated with an aberrant T139 expression or activity, 
by administering to the subject an agent which modulates 

30 T139 expression or at least one T139 activity. Subjects 
at risk for a disease which is caused or contributed to 
by aberrant T13 9 expression or activity can be identified 
by, for example, any or a combination of diagnostic or 
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prognostic assays as described herein. Administration of 
a prophylactic agent can occur prior to the manifestation 
of symptoms characteristic of the T139 aberrancy, such 
that a disease or disorder is prevented or, 
5 alternatively, delayed in its progression. Depending on 
the type of T139 aberrancy, for example, a T139 agonist 
or T139 antagonist agent can be used for treating the 
subject. The appropriate agent can be determined based 
on screening assays described herein. 

10 2. Therapeutic Methods ^ 

Another aspect of the invention pertains to 
methods of modulating T139 expression or activity for 
therapeutic purposes. The modulatory method of the 
invention involves contacting a cell with an agent that 

15 modulates one or more of the activities of T139 protein 
activity associated with the cell. An agent that 
modulates T139 protein activity can be an agent as 
described herein, such as a nucleic acid or a protein, a 
naturally-occurring cognate ligand of a T139 protein, a 

20 peptide, a T139 peptidomimetic, or other small molecule. 
In one embodiment, the agent stimulates one or more of 
the biological activities of T13 9 protein. Examples of 
such stimulatory agents include active T139 protein and a 
nucleic acid molecule encoding T139 that has been 

25 introduced into the cell. In another embodiment, the 
agent inhibits one or more of the biological activities 
of T139 protein. Examples of such inhibitory agents 
include antisense T139 nucleic acid molecules and anti- 
T13 9 antibodies. These modulatory methods can be 

30 performed in vitro (e.g., by culturing the cell with the 
agent) or, alternatively, in vivo (e.g, by administering 
the agent to a subject) . As such, the present invention 
provides methods of treating an individual afflicted with 
a disease or disorder characterized by aberrant 
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expression or activity of a T13 9 protein or nucleic acid 
molecule. In one embodiment, the method involves 
administering an agent (e.g., an agent identified by a 
screening assay described herein) , or combination of 
5 agents that modulates (e.g., upregulates or 

downregulates) T139 expression or activity. In another 
embodiment, the method involves administering a T139 
protein or nucleic acid molecule as therapy to compensate 
for reduced or aberrant T139 expression or activity. 

10 Stimulation of T13 9 activity is desirable in 

situations in which T139 is abnormally downregulated 
and/or in which increased T13 9 activity is likely to have 
a beneficial effect. Conversely, inhibition of T139 
activity is desirable in situations in which T139 is 

15 abnormally upregulated and/or in which decreased T139 
activity is likely to have a beneficial effect. 

This invention is further illustrated by the 
following examples which should not be construed as 
limiting. The contents of all references, patents and 

20 published patent applications cited throughout this 
application are hereby incorporated by reference. 

EXAMPLES 

Example 1 : Isolation and Characterization of Human T139 
cDNA 

25 RNA was isolated from human fetal kidney tissue, 

and the polyA+ fraction was purified using Oligotex beads 
(Qiagen) . Three micrograms of polyA+ RNA were used to 
synthesize a cDNA library using the Superscript cDNA 
Synthesis kit (Gibco BRL; Gaithersburg, MD) . 

30 Complementary DNA was directionally cloned into the 

expression plasmid pMET7 using the Sail and Not I sites in 
the polylinker to construct a plasmid library. 
Transformants were picked and grown for single-pass 
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sequencing. One cDNA clone ( j thKall5e09) was identified 
that encoded a protein with homology to testis-specif ic 
protein- 1 (TPX-1) , an acrosomal sperm protein that is a 
member of the SCP-like family of cysteine-rich secreted 
5 proteins. JthKall5e09 contains an open reading frame of 
446 amino acids, which is referred to as "Tango 139". 

Example 2 : Distribution of T139 mRNA in Human Tissues 

The expression of T139 was analyzed using Northern 
blot hybridization. Oligonucleotide primers (5 ' 

10 CCATGCTGCATCCAGAG 3 # (SEQ ID-NO :-14) ; 5' 

CACAGACAAAGGCTTCTATC 3' (SEQ ID NO: 15)) were used to 
amplify a 543 bp fragment from the coding region of 
jthKall4e09, and the DNA was radioactively labeled with 
32 P-dCTP using a Prime- It kit (Stratagene, La Jolla, CA) 

15 according to the supplier's instructions. Filters 

containing human mRNA (MTNI and MTNII from Clontech, Palo 
Alto, CA) were probed in ExpressHyb hybridization 
solution (Clontech) and washed at high stringency 
according to manufacturer's recommendations. 

20 Tango 139 is expressed at high levels as a 

transcript of about 2.0 kb in the kidney, with lower 
levels in the testis. In addition, there are additional 
transcripts in both kidney and testis at about 2.4 and 
3.5 kb. No other tissues examined (heart, brain, 

25 placenta, lung, liver, skeletal muscle, pancreas, spleen, 
thymus, ovaries, small intestine, colon and peripheral 
blood leukocytes) showed any expression. 

Example 3: Characterization of T13 9 Proteins 

In this example, the predicted amino acid sequence 
30 of human T139 protein was compared to amino acid 
sequences of known proteins and various motifs were 
identified. In addition, the molecular weight of the 
human T13 9 proteins was predicted. 
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The human T139 cDNA isolated as described above 
(Figure 1; SEQ ID N0:1) encodes a 446 amino acid protein 
(Figure 1; SEQ ID NO: 2) . The signal peptide prediction 
program SIGNALP (Nielsen et al. (1997) Protein 
5 Engineering 10:1-6) predicted that T139 includes a 26 
amino acid signal peptide (amino acid 1 to about amino 
acid 26 of SEQ ID NO: 2) preceding the 42 0 amino acid 
mature protein (about amino acid 27 to amino acid 446; 
SEQ ID NO: 4) . A hydropathy plot of T139 is presented in 

10 Figure 3. This plot shows the location of cysteines 
("cys"; short vertical lines-just below plot) and the 
PFAM identifiers (PF00188, PF00008, and PF00059; bars 
just above plot) . For general information regarding PFAM 
identifiers refer to Sonnhammer et al . (1997) Protein 

15 28:405-420 and http://www.psc.edu/general/software/ 
packages/pf am/pf am. html . 

As shown in Figure 2A, T139 has a region of 
homology (amino acids 47 to 190, of SEQ ID N0:2; SEQ ID 
NO: 5) to a SCP-like domain consensus sequence (PF00188, 

20 of SEQ ID NO: 2; SEQ ID NO: 9). Figure 2B shows the region 
of homology (amino acids 297 to 412, SEQ ID NO:2; SEQ ID 
NO: 6) to the C-type lectin domain consensus sequence 
(PF00059, SEQ ID NO: 10) . Although significant homology 
was observed, the four cysteines in this region of T139 

25 do not match the four conserved cysteines in the 

consensus sequence; the alignment only recognized three 
of the four cysteines in this region of T13 9 as identical 
to the consensus cysteines. Figure 2C shows the regions 
of homology (amino acids 232 to 260 of SEQ ID NO: 2; 

30 (EGF1) and 264 to 291 of SEQ ID NO:2; (EGF2) ; SEQ ID 

N0S:7 and 8, respectively) to a EGF-like domain consensus 
sequence (PF00008, SEQ ID N0:11). Although both the EGF1 
and EGF2 domains contain six cysteines as does the 
consensus, the alignment only recognizes five cysteines 

35 as matching the consensus. Mature T139 has a predicted 
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MW of 49 kDa (47 kDa with the signal peptide removed) , 
not including post-translational modifications. A signal 
peptide is predicted to exist from amino acids 1 to 26, 
using the prediction program SIGNALP (Nielsen et al . 
5 (1997) Protein Engineering 10:1-6). 

As noted above, T139 is homologous to testis- 
specific protein- 1 (TPX-1; Kasahara et al. (1989) 
Genomics 5:527-34). TPX-1 is a testis-specif ic protein 
that is expressed by spermatogenic cells and is thought 

10 to cause spermatogenic cells to bind to somatic Sertoli 
cells. Sertoli cells are essential for the 
proliferation, differentiation, and survival of 
spermatogenic cells. Human TPX-1 is a member of the 
CRISP family. Members of this family have a cysteine - 

15 rich C- terminal region having the pattern CX 8 CX 8 CX 8 CX 8 C 
(Foster and Gerton (1996) Mol* Reprod. Dev. 44:221-29; 
Kratzch et al . (1996) Eur, J. Biochem. 236:827-36). 
Another member of this family is acidic epididymal 
glycoprotein (AEG) . This protein is present on the 

20 surface of the plasma membrane at the sperm head and is 
thought to be involved in sperm-egg fusion (Brooks and 
Tiver (1983) J. Reprod. Fertil. 69:650-57). 

Based on the homology between T139 and TPX-1 and 
the expression of T139 in testis, T139 may play a role in 

25 spermatogenesis and other testicular functions. 

Accordingly, T139 nucleic acids, polypeptides, and 
modulators of T139 expression or activity can be used to 
modulate spermatogenesis or sperm-egg fusion. In 
addition, such molecules may be useful for treatment of a 

30 variety of testicular disorders, e.g., testicular cancer. 

Example 4 : Preparation of T139 Proteins 

Recombinant T129 can be produced in a variety of 
expression systems. For example, the mature T129 peptide 
can be expressed as a recombinant glutathione-S- 
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transferase (GST) fusion protein in E. coli and the 
fusion protein can be isolated and characterized. 
Specifically, as described above, T129 can be fused to 
GST and this fusion protein can be expressed in E. coli 
5 strain PEB199. Expression of the GST-T129 fusion protein 
in PEB199 can be induced with IPTG. The recombinant 
fusion protein can be purified from crude bacterial 
lysates of the induced PEB199 strain by affinity 
chromatography on glutathione beads. 

10 Equivalents - 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
embodiments of the invention described herein. Such 

15 equivalents are intended to be encompassed by the 
following claims. 
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What is claimed is: 

1. An isolated nucleic acid molecule selected 
from the group consisting of: 

a) a nucleic acid molecule comprising a 

5 nucleotide sequence which is at least 55% identical to 
the nucleotide sequence of SEQ ID N0:1 or SEQ ID NO: 3, 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694, or a complement thereof ; 

b) a nucleic acid molecule comprising a fragment 
10 of at least 250 nucleotides of the nucleotide sequence of 

SEQ ID NO:l or SEQ ID NO : 3<y* the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98694, or 
a complement thereof; 

c) nucleic acid molecule which encodes a 

15 polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694; 

d) a nucleic acid molecule which encodes a 
20 fragment of a polypeptide comprising the amino acid 

sequence of SEQ ID NO: 2 or SEQ ID N0:4, wherein the 
fragment comprises at least 15 contiguous amino acids of 
SEQ ID NO: 2 or SEQ ID NO: 4 or the polypeptide encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
25 Accession Number 98694; and 

e) a nucleic acid molecule which encodes a 
naturally occurring allelic variant of a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO: 4 or an amino acid sequence encoded by the cDNA 

30 insert of the plasmid deposited with ATCC as Accession 
Number 98694, wherein the nucleic acid molecule 
hybridizes to a nucleic acid molecule comprising SEQ ID 
NO:l or SEQ ID NO: 3 under stringent conditions. 
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2 . The isolated nucleic acid molecule of claim 
1, which is selected from the group consisting of: 

a) a nucleic acid comprising the nucleotide 
sequence of SEQ ID N0:1 or SEQ ID NO: 3, or the cDNA 

5 insert of the plasmid deposited with ATCC as Accession 
Number 98694, or a complement thereof; and 

b) a nucleic acid molecule which encodes a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 

10 the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694. 

3 . The nucleic acid molecule of claim 1 further 
comprising vector nucleic acid sequences. 

4 . The nucleic acid molecule of claim 1 further 
15 comprising nucleic acid sequences encoding a heterologous 

polypept ide . 

5 . A host cell which contains the nucleic acid 
molecule of claim 1. 

6. The host cell of claim 4 which is a mammalian 
20 host cell. 

7. A non-human mammalian host cell containing 
the nucleic acid molecule of claim 1 . 

8. An isolated polypeptide selected from the 
group consisting of: 

25 a) a fragment of a polypeptide comprising the 

amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4, 
wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO: 2 or SEQ ID NO: 4; 
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b) a naturally occurring allelic variant of a 
polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 

5 Accession Number 98694, wherein the polypeptide is 

encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID N0:1 or SEQ ID 
NO: 3 under stringent conditions ; 

c) a polypeptide which is encoded by a nucleic 
10 acid molecule comprising a nucleotide sequence which is 

at least 55% identical to a nucleic acid comprising the 
nucleotide sequence of SEQ ID NO:l or SEQ ID NO: 3. 

9. The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO: 2 or SEQ 

15 ID NO: 4 or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 
Number 98694. 

10. The polypeptide of claim 8 further comprising 
heterologous amino acid sequences. 

20 11. An antibody which selectively binds to a 

polypeptide of claim 8. 

12 . A method for producing a polypeptide selected 
from the group consisting of: 

a) a polypeptide comprising the amino acid 

25 sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or an amino acid 
sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98694; 

b) a fragment of a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 2 or SEQ ID NO: 4 or an 

3 0 amino acid sequence encoded by the cDNA insert of the 
plasmid deposited with ATCC as Accession Number 98694, 
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wherein the fragment comprises at least 15 contiguous 
amino acids of SEQ ID NO: 2 or SEQ ID NO: 4 or an amino 
acid sequence encoded by the cDNA insert of the plasmid 
deposited with ATCC as Accession Number 98694; and 
5 c) a naturally occurring allelic variant of a 

polypeptide comprising the amino acid sequence of SEQ ID 
NO: 2 or SEQ ID NO: 4 or an amino acid sequence encoded by 
the cDNA insert of the plasmid deposited with ATCC as 
Accession Number 98694, wherein the polypeptide is 

10 encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule comprising SEQ ID NO:l or SEQ ID 
NO: 3 under stringent conditions; 

comprising culturing the host cell of claim 5 
under conditions in which the nucleic acid molecule is 

15 expressed. 

13 . The isolated polypeptide of claim 8 
comprising the amino acid sequence of SEQ ID NO: 2 or SEQ 
ID NO: 4 or an amino acid sequence encoded by the cDNA 
insert of the plasmid deposited with ATCC as Accession 

20 Number 98694. 

14 . A method for detecting the presence of a 
polypeptide of claim 8 in a sample, comprising: 

a) contacting the sample with a compound which 
selectively binds to a polypeptide of claim 8; and 
25 b) determining whether the compound binds to the 

polypeptide in the sample. 

15. The method of claim 14, wherein the compound 
which binds to the polypeptide is an antibody. 



16. A kit comprising a compound which selectively 
30 binds to a polypeptide of claim 8 and instructions for 
use . 
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17. A method for detecting the presence of a 
nucleic acid molecule of claim 1 in a sample, comprising 
the steps of: 

a) contacting the sample with a nucleic acid 
5 probe or primer which selectively hybridizes to the 

nucleic acid molecule; and 

b) determining whether the nucleic acid probe or 
primer binds to a nucleic acid molecule in the sample. 

18. The method of claim 17, wherein the sample 
10 comprises mRNA molecules and is contacted with a nucleic 

acid probe. 

19. A kit comprising a compound which selectively 
hybridizes to a nucleic acid molecule of claim 1 and 
instructions for use. 



15 20. A method for identifying a compound which 

binds to a polypeptide of claim 8 comprising the steps 
of: 

a) contacting a polypeptide, or a cell 
expressing a polypeptide of claim 8 with a test compound; 

20 and 

b) determining whether the polypeptide binds to 
the test compound. 

21. The method of claim 20, wherein the binding 
of the test compound to the polypeptide is detected by a 
25 method selected from the group consisting of: 

a) detection of binding by direct detecting of 
test compound/polypeptide binding; 

b) detection of binding using a competition 
binding assay; 

30 c) detection of binding using an assay for EGF 

or calcium-dependent carbohydrate recognition activity. 
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22. A method for modulating the activity of a 
polypeptide of claim 8 comprising contacting a 
polypeptide or a cell expressing a polypeptide of claim 8 
with a compound which binds to the polypeptide in a 

5 sufficient concentration to modulate the activity of the 
polypeptide. 

23. A method for identifying a compound which 
modulates the activity of a polypeptide of claim 8, 
comprising: 

10 a) contacting a- polypeptide of claim 8 with a 

test compound; and 

b) determining the effect of the test compound 
on the activity of the polypeptide to thereby identify a 
compound which modulates the activity of the polypeptide. 
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GTCGACCCACGCGTCCGGCTCCATCC^GCCTGAGAAACAAGCCGGGTGGCTGAGCCAGG^ .GTGCACGGAGTGCCTGAC 7 9 

GGGCCCAACAGACCC ATG CTG CAT CCA GAG ACC TCC CCT GGC CGG GGG CAT CTC CTG GCT GTG 142 
Met Leu His Pro Glu Thr Ser Pro Gly Arg Gly His Leu Leu Ala Val 
15 10 15 

CTC CTG GCC CTC CTT GGC ACC GCC TGG GCA GAG GTG TGG CCA CCC CAG CTG CAG GAG CAG 202 
Leu Leu Ala Leu Leu Gly Thr Ala Trp Ala Glu Val Trp Pro Pro Gin Leu Gin Glu Gin 
20 25 30 35 

GCT CCG ATG GCC GGA GCC CTG AAC AGG AAG GAG AGT TTC TTG CTC CTC TCC CTG CAC AAC 262 
Ala Pro Met Ala Gly Ala Leu Asn Arg Lys Glu Ser Phe Leu Leu Leu Ser Leu His Asn 
40 45 50 55 

CGC CTG CGC AGC TGG GTC CAG CCC CCT GCG GCT GAC ATG CGG AGG CTG GAC TGG AGT GAC 322 
Ara Leu Arg Ser Trp Val Gin Pro Pro Ala Ala Asp Met Arg Arg Leu Asp Trp Ser Asp 
* 60 65 70 75 

AGC CTG GCC CAA CTG GCT CAA GCC AGG GCA GCC CTC TGT GGA ATC CCA ACC CCG AGC CTG 382 
Ser Leu Ala Gin Leu Ala Gin Ala Arg Ala Ala Leu Cys Gly lie Pro Thr Pro Ser Leu 
80 '85 90 95 

GCG TCC GGC CTG TGG CGC ACC CTG CAA GTG GGC TGG AAC ATG CAG CTG CTG CCC GCG GGC 442 
Ala Ser Gly Leu Trp Arg Thr Leu Gin Val Gly Trp Asn Met Gin Leu Leu Pro Ala Gly 
100 105 HO 115 

TTG GCG TCC TTT GTT GAA GTG GTC AGC CTA TGG TTT GCA GAG GGG CAG CGG TAC AGC CAC 502 
Leu Ala Ser Phe Val Glu Val Val Ser Leu Trp Phe Ala Glu Gly Gin Arg Tyr Ser His 
120 125 130 135 

GCG GCA GGA GAG TGT GCT CGC AAC GCC ACC TGC ACC CAC TAC ACG CAG CTC GTG TGG GCC 562 
Ala Ala Gly Glu Cys Ala Arg Asn Ala Thr Cys Thr His Tyr Thr Gin Leu Val Trp Ala 
140 145 150 155 

ACC TCA AGC CAG CTG GGC TGT GGG CGG CAC CTG TGC TCT GCA GGC CAG GCA GCG ATA GAA 622 
Thr Ser Ser Gin Leu Gly Cys Gly Arg His Leu Cys Ser Ala Gly Gin Ala Ala He Glu 
160 165 170 175 

GCC TTT GTC TGT GCC TAC TCC CCC GGA GGC AAC TGG GAG GTC AAC GGG AAG ACA ATC ATC 682 
Ala Phe Val Cys Ala Tyr Ser Pro Gly Gly Asn Trp Glu Val Asn Gly Lys Thr He He 
180 185 190 195 

CCC TAT AAG AAG GGT GCC TGG TGT TCG CTC TGC ACA GCC AGT GTC TCA GGC TGC TTC AAA 742 
Pro Tvr Lvs Lys Gly Ala Trp Cys Ser Leu Cys Thr Ala Ser Val Ser Gly Cys Phe Lys 
* 200 205 210 215 

GCC TGG GAC CAT GCA GGG GGG CTC TGT GAG GTC CCC AGG AAT CCT TGT CGC ATG AGC TGC 802 
Ala Trp Asp His Ala Gly Gly Leu Cys Glu Val Pro Arg Asn Pro Cys Arg Met Ser Cys 
220 225 230 235 

CAG AAC CAT GGA CGT CTC AAC ATC AGC ACC TGC CAC TGC CAC TGT CCC CCT GGC TAC ACG 862 
Gin Asn His Gly Arg Leu Asn He Ser Thr Cys His Cys His Cys Pro Pro Gly Tyr Thr 
240 245 250 255 

GGC AGA TAC TGC CAA GTG AGG TGC AGC CTG CAG TGT GTG CAC GGC CGG TTC CGG GAG GAG 922 
Gly Arg Tyr Cys Gin Val Arg Cys Ser Leu Gin Cys Val His Gly Arg Phe Arg Glu Glu 
260 265 270 275 

GAG TGC TCG TGC GTC TGT GAC ATC GGC TAC GGG GGA GCC CAG TGT GCC ACC AAG GTG CAT 982 
Glu Cys Ser Cys Val Cys Asp He Gly Tyr Gly Gly Ala Gin Cys Ala Thr Lys Val His 
280 285 290 295 

TTT CCC TTC CAC ACC TGT GAC CTG AGG ATC GAC GGA GAC TGC TTC ATG GTG TCT TCA GAG 1042 
Phe Pro Phe His Thr Cys Asp Leu Arg He Asp Gly Asp Cys Phe Met Val Ser Ser Glu 
300 305 310 315 

GCA GAC ACC TAT TAC AGA GCC AGG ATG AAA TGT CAG AGG AAA GGC GGG GTG CTG GCC CAG 1102 
Ala Asp Thr Tyr Tvr Arg Ala Arg Met Lys Cys Gin Arg Lys Gly Gly Val Leu Ala Gin 
320 " 325 330 335 



FIG. 1 
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ATC AAG AGC CAG AAA GTG CAG GAC ATC CTC GCC TTC TAT CTG GGC CGC CTG GAG ACC ACC 1162 
He Lys Ser Gin Lys Val Gin Asp He Leu Ala Phe Tyr Leu Gly Arg Leu Glu Thr Thr 
340 345 350 355 

AAC GAG GTG ATT GAC AGT GAC TTC GAG ACC AGG AAC TTC TGG ATC GGG CTC ACC TAC AAG 1222 
Asn Glu Val He Asp Ser Asp Phe Glu Thr Arg Asn Phe Trp He Gly Leu Thr Tyr Lys 
360 365 370 375 

ACC GCC AAG GAC TCC TTC CGC TGG GCC ACA GGG GAG CAC CAG GCC TTC ACC AGT TTT GCC 1282 
Thr Ala Lvs Asp Ser Phe Arg Trp Ala Thr Gly Glu His Gin Ala Phe Thr Ser Phe Ala 
* 380 385 390 395 

TTT GGG CAG CCT GAC AAC CAC GGG TTT GGC AAC TGC GTG GAG CTG CAG GCT TCA GCT GCC 1342 
Phe Gly Gin Pro Asp Asn His Gly Phe Gly Asn Cys Val Glu Leu Gin Ala Ser Ala Ala 
400 405 410 415 

TTC AAC TGG AAC AAC CAG CGC TGC AAA ACC CGA AAC CGT TAC ATC TGC CAG TTT GCC CAG 1402 
Phe Asn Trp Asn Asn Gin Arg Cys Lys Thr Arg Asn Arg Tyr He Cys Gin Phe Ala Gin 
420 425 430 435 

GAG CAC ATC TCC CGG TGG GGC CCA GGG TCC TGA 1435 
Glu His He Ser Arg Trp Gly Pro Gly Ser * 
440 445 

GGCCTGACCACATGGCTCCCTCGCCTGCCCTGGGAGCACCGGCTCTGCTTACCTGTCCGC^ 1514 

GCCAGGTTAAGACCACATGCCTCATGTCCAAAGAGGTCTCAGACCTrGC 1593 

GGGAGGCCAGTGAGGGCCAGGGAGTGAGTGTTAGAAGAAGCT^ 1672 

TCAATTAGATGGCAAAGGAGAGGACACCGCCAGTGGTCCAA 1751 

GOVGCGGAGCTTCCCraTGGCATGAACCCCACAGGGT^ 1830 

AAAAAAAAAAAAAAAAGGGCGGCCGC 1B56 



FIG. 1 (cont.) 
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SEQUENCE LISTING 

(1> GENERAL INFORMATION 
(i) APPLICANT: Holtzman, Douglas 

(ii) TITLE OF THE INVENTION: NOVEL MOLECULES OF THE T13 9 -RELATED 

PROTEIN FAMILY AND USES THEREOF 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson P. C, 

(B) STREET: 225 Franklin Street 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA . 

(F) ZIP: 02110-2804 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows 95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 23 -APR- 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : 

(B) FILING DATE: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Meiklejohn, Ph.D., Anita L. 

(B) REGISTRATION NUMBER: 35,283 

(C) REFERENCE /DOCKET NUMBER: 09404/047001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617/542-5070 

(B) TELEFAX: 617/542-8906 

(C) TELEX: 200154 

(2) INFORMATION FOR SBQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/ KEY: Coding Sequence 

(B) LOCATION: 95 . . . 1432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTCGACCCAC GCGTCCGGCT CCATCCAGCC TGAGAAACAA GCCGGGTGGC TGAGCCAGGC 
TGTGCACGGA GTGCCTGACG GGCCCAACAG ACCC ATG CTG CAT CCA GAG ACC TCC 

Met Leu His Pro Glu Thr Ser 
1 5 



60 
115 
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CCT GGC CGG GGG CAT CTC CTG GCT GTG CTC CTG GCC CTC CTT GGC ACC 163 
Pro Gly Arg Gly His Leu Leu Ala Val Leu Leu Ala Leu Leu Gly Thr 
10 15 20 

GCC TGG GCA GAG GTG TGG CCA CCC CAG CTG CAG GAG CAG GCT CCG ATG 211 
Ala Trp Ala Glu Val Trp Pro Pro Gin Leu Gin Glu Gin Ala Pro Met 
25 30 35 

GCC GGA GCC CTG AAC AGG AAG GAG AGT TTC TTG CTC CTC TCC CTG CAC 259 
Ala Gly Ala Leu Asn Arg Lys Glu Ser Phe Leu Leu Leu Ser Leu His 
40 45 50 55 

AAC CGC CTG CGC AGC TGG GTC CAG CCC CCT GCG GCT GAC ATG CGG AGG 307 
Asn Arg Leu Arg Ser Trp Val Gin Pro Pro Ala Ala Asp Met Arg Arg 
60 65 70 

CTG GAC TGG AGT GAC AGC CTG GCC CAA CTG GCT CAA GCC AGG GCA GCC 355 
Leu Asp Trp Ser Asp Ser Leu Ala Gin Leu Ala Gin Ala Arg Ala Ala 
75 80 85 

CTC TGT GGA ATC CCA ACC CCG AGC CTG GCG TCC GGC CTG TGG CGC ACC 403 
Leu Cys Gly lie Pro Thr Pro Ser Leu Ala Ser Gly Leu Trp Arg Thr 
90 95 100 

CTG CAA GTG GGC TGG AAC ATG CAG CTG CTG CCC GCG GGC TTG GCG TCC 451 
Leu Gin Val Gly Trp Asn Met Gin Leu Leu Pro Ala Gly Leu Ala Ser 
105 110 115 

TTT GTT GAA GTG GTC AGC CTA TGG TTT GCA GAG GGG CAG CGG TAC AGC 499 
Phe Val Glu Val Val Ser Leu Trp Phe Ala Glu Gly Gin Arg Tyr Ser 
120 125 130 135 

CAC GCG GCA GGA GAG TGT GCT CGC AAC GCC ACC TGC ACC CAC TAC ACG 547 
His Ala Ala Gly Glu Cys Ala Arg Asn Ala Thr Cys Thr His Tyr Thr 
140 145 150 

CAG CTC GTG TGG GCC ACC TCA AGC CAG CTG GGC TGT GGG CGG CAC CTG 595 
Gin Leu Val Trp Ala Thr Ser Ser Gin Leu Gly Cys Gly Arg His Leu 
155 160 165 

TGC TCT GCA GGC CAG GCA GCG ATA GAA GCC TTT GTC TGT GCC TAC TCC 643 
Cys Ser Ala Gly Gin Ala Ala lie Glu Ala Phe Val Cys Ala Tyr Ser 
170 175 180 

CCC GGA GGC AAC TGG GAG GTC AAC GGG AAG ACA ATC ATC CCC TAT AAG 691 
Pro Gly Gly Asn Trp Glu Val Asn Gly Lys Thr lie lie Pro Tyr Lys 
185 190 195 

AAG GGT GCC TGG TGT TCG CTC TGC ACA GCC AGT GTC TCA GGC TGC TTC 739 
Lys Gly Ala Trp Cys Ser Leu Cys Thr Ala Ser Val Ser Gly Cys Phe 
200 205 210 215 

AAA GCC TGG GAC CAT GCA GGG GGG CTC TGT GAG GTC CCC AGG AAT CCT 787 
Lys Ala Trp Asp His Ala Gly Gly Leu Cys Glu Val Pro Arg Asn Pro 
220 225 230 

TGT CGC ATG AGC TGC CAG AAC CAT GGA CGT CTC AAC ATC AGC ACC TGC 835 
Cys Arg Met Ser Cys Gin Asn His Gly Arg Leu Asn lie Ser Thr Cys 
235 240 245 

CAC TGC CAC TGT CCC CCT GGC TAC ACG GGC AGA TAC TGC CAA GTG AGG 883 
His Cys His Cys Pro Pro Gly Tyr Thr Gly Arg Tyr Cys Gin Val Arg 
250 255 260 



V 
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TGC AGC CTG CAG TGT GTG CAC GGC CGG TTC CGG GAG GAG GAG TGC TCG 931 
Cys Ser Leu Gin Cys Val His Gly Arg Phe Arg Glu Glu Glu Cys Ser 
265 270 275 

TGC GTC TGT GAC ATC GGC TAC GGG GGA GCC CAG TGT GCC ACC AAG GTG 979 
Cys Val Cys Asp lie Gly Tyr Gly Gly Ala Gin Cys Ala Thr Lys Val 
280 285 290 295 

CAT TTT CCC TTC CAC "ACC TGT GAC CTG AGG ATC GAC GGA GAC TGC TTC 1027 
His Phe Pro Phe His Thr Cys Asp Leu Arg lie Asp Gly Asp Cys Phe 
300 305 310 

ATG GTG TCT TCA GAG GCA GAC ACC TAT TAC AGA GCC AGG ATG AAA TGT 1075 
Met Val Ser Ser Glu Ala Asp Thr Tyr Tyr Arg Ala Arg Met Lys Cys 
315 320 325 

CAG AGG AAA GGC GGG GTG CTG GCC CAG ATC AAG AGC CAG AAA GTG CAG 1123 
Gin Arg Lys Gly Gly Val Leu- Ala Gin lie Lys Ser Gin Lys Val Gin 
330 335 340 

GAC ATC CTC GCC TTC TAT CTG GGC CGC CTG GAG ACC ACC AAC GAG GTG 1171 
Asp lie Leu Ala Phe Tyr Leu Gly Arg Leu Glu Thr Thr Asn Glu Val 
345 350 355 

ATT GAC AGT GAC TTC GAG ACC AGG AAC TTC TGG ATC GGG CTC ACC TAC 1219 
He Asp Ser Asp Phe Glu Thr Arg Asn Phe Trp He Gly Leu Thr Tyr 
360 365 370 375 

AAG ACC GCC AAG GAC TCC TTC CGC TGG GCC ACA GGG GAG CAC CAG GCC 1267 
Lys Thr Ala Lys Asp Ser Phe Arg Trp Ala Thr Gly Glu His Gin Ala 
380 385 390 

TTC ACC AGT TTT GCC TTT GGG CAG CCT GAC AAC CAC GGG TTT GGC AAC 1315 
Phe Thr Ser Phe Ala Phe Gly Gin Pro Asp Asn His Gly Phe Gly Asn 
395 400 405 

TGC GTG GAG CTG CAG GCT TCA GCT GCC TTC AAC TGG AAC AAC CAG CGC 1363 
Cys Val Glu Leu Gin Ala Ser Ala Ala Phe Asn Trp Asn Asn Gin Arg 
410 415 420 

TGC AAA ACC CGA AAC CGT TAC ATC TGC CAG TTT GCC CAG GAG CAC ATC 1411 
Cys Lys Thr Arg Asn Arg Tyr He Cys Gin Phe Ala Gin Glu His He 
425 430 435 

TCC CGG TGG GGC CCA GGG TCC TGAGGCCTGA CCACATGGCT CCCTCGCCTG CCCT 1466 
Ser Arg Trp Gly Pro Gly Ser 
440 445 

GGGAGCACCG GCTCTGCTTA CCTGTCCGCC CACCTGTCTG GAACAAGGGC CAGGTTAAGA 1526 

CCACATGCCT CATGTCCAAA GAGGTCTCAG ACCTTGCACA ATGCCAGAAG TTGGGCAGAG 1586 

AGAGGCAGGG AGGCCAGTGA GGGCCAGGGA GTGAGTGTTA GAAGAAGCTG GGGCCCTTCG 1646 

CCTGCTTTTG ATTGGGAAGA TGGGCTTCAA TTAGATGGCA AAGGAGAGGA CACCGCCAGT 1706 

GGTCCAAAAA GGCTGCTCTC TTCCACCTGG CCCAGACCCT GTGGGGCAGC GGAGCTTCCC 1766 

TGTGGCATGA ACCCCACAGG GTATTAAATT ATGAATCAGC TGAAAAAAAA AAAAAAAAAA 1826 

AAAAAAAAAA AAAAAAAAAA GGGCGGCCGC 1856 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 





<xi) SEQUENCE 


DESCRIPTION 


: SEQ ID 


N0:2: 


Met 


Leu 


His 


Pro 


Glu 


Thr 


Ser 


Pro 


Gly 


Arg 


Gly 


His Leu Leu Ala Val 


1 








5 










10 




15 


Leu 


Leu 


Ala 


Leu 


Leu 


•Gly Thr 


Ala 


Trp 


Ala 


Glu 


Val Trp Pro Pro Gin 








20 










25 






30 


Leu 


Gin 


Glu 


Gin 


Ala 


Pro 


Met 


Ala 


Gly 


Ala 


Leu 


Asn Arg Lys Glu Ser 






35 










40 








45 


Phe 


Leu 


Leu 


Leu 


Ser 


Leu 


His 


Asn 


Arg 


Leu 


Arg 


Ser Trp Val Gin Pro 




50 










55 










60 


Pro 


Ala 


Ala 


Asp 


Met 


Arg Arg 


Leu 


Asp 


Trp 


Ser 


Asp Ser Leu Ala Gin 


65 










70 










75 


80 


Leu 


Ala 


Gin 


Ala 


Arg 


Ala Ala 


Leu 


Cys 


Gly 


He 


Pro Thr Pro Ser Leu 










85 










90 




95 


Ala 


Ser 


Gly 


Leu 


Trp 


Arg~Thr 


Leu 


Gin 


Val 


Gly 


Trp Asn Met Gin Leu 








100 










105 






110 


Leu 


Pro 


Ala 


Gly 


Leu 


Ala 


Ser 


Phe 


Val 


Glu 


Val 


Val Ser Leu Trp Phe 






115 










120 








125 


Ala 


Glu 


Gly 


Gin 


Arg 


Tyr 


bsr 


His 


Ala 


Ala 


Gly 


Glu Cys Ala Arg Asn 




130 










lie 
1J J 










140 




Thr 


Cys 


Thr 


His 


Tyr 


± 1.1 X. 


Gin 


Leu 


Val 


Trp 


Ala Thr Ser Ser Gin 


145 


















155 


160 


Leu 


Gly 


Cys 


Gly 


Arg 


His 


Leu 


Cys 


Ser 


Ala 


Gly 


Gin Ala Ala He Glu 










165 










170 




175 


Al a 


Phe 


Val 


Cys 


Ala 




Ser 


Pro 


Gly 


Gly 


Asn 


Trp Glu Val Asn Gly 








180 










185 






190 


Liys 


Thr 


lie 


lie 


Pro 


Tyr 


Lys 


Lys 


Gly 


Ala 


Trp 


Cys Ser Leu Cys Thr 






195 










200 








205 


Ala 


Ser 


Val 


Ser 


Gly 


Cys 


Phe 


Lys 


Ala 


Trp 


Asp 


His Ala Gly Gly Leu 




210 










215 










220 


Cys 


Glu 


Val 


Pro 


Arg 


Asn 


Pro 


Cys 


Arg 


Met 


Ser 


Cys Gin Asn His Gly 


225 










230 










235 


240 


Ara 


Leu 


Asn 


lie 


Ser 


Thr 


Cys 


His 


Cys 


His 


Cys 


Pro Pro Gly Tyr Thr 










245 










250 




255 


Gly 


Arg 


Tyr 


Cys 


Gin 


Val 


Arg 


Cys 


Ser 


Leu 


Gin 


Cys Val His Gly Arg 








260 










265 






270 


Phe 


Arg 


Glu 


Glu 


Glu 


Cys 


Ser 


Cys 


Val 


Cys 


Asp 


He Gly Tyr Gly Gly 






275 










280 








285 


Ala 


Gin 


Cys 


Ala 


Thr 


Lys 


Val 


His 


Phe 


Pro 


Phe 


His Thr Cys Asp Leu 




290 








295 










300 


Arg 


lie 


Asp 


Gly 


Asp 


Cys 


Phe 


Met 


Val 


Ser 


Ser 


Glu Ala Asp Thr Tyr 


305 










310 










315 


320 


Tyr 


Arg 


Ala 


Arg 


Met 


Lys 


Cys 


Gin 


Arg 


Lys 


Gly 


Gly Val Leu Ala Gin 










325 










330 




335 


lie 


Lys 


Ser 


Gin 


Lys 


Val 


Gin 


Asp 


He 


Leu 


Ala 


Phe Tyr Leu Gly Arg 








340 










345 






350 


Leu 


Glu 


Thr 


Thr 


Asn 


Glu 


Val 


He 


Asp 


Ser 


Asp 


Phe Glu Thr Arg Asn 






355 










360 








365 


Phe 


Trp 


lie 


Gly 


Leu 


Thr 


Tyr 


Lys 


Thr 


Ala 


Lys 


Asp Ser Phe Arg Trp 




370 










375 










380 


Ala 


Thr 


Gly 


Glu 


His 


Gin 


Ala 


Phe 


Thr 


Ser 


Phe 


Ala Phe Gly Gin Pro 


385 










390 










395 


400 


Asp 


Asn 


His 


Gly 


Phe 


Gly 


Asn 


Cys 


Val 


Glu 


Leu 


Gin Ala Ser Ala Ala 










405 










410 




415 


Phe 


Asn 


Trp 


Asn 


Asn 


Gin 


Arg 


Cys 


Lys 


Thr 


Arg 


Asn Arg Tyr He Cys 








420 










425 






430 


Gin 


Phe 


Ala 


Gin 


Glu 


His 


He 


Ser 


Arg 


Trp 


Gly 


Pro Gly Ser 






435 










440 








445 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATGCTGCATC CAGAGACCTC CCCTGGCCGG GGGCATCTCC TGGCTGTGCT CCTGGCCCTC 60 

CTTGGCACCG CCTGGGCAGA GGTGTGGCCA CCCCAGCTGC AGGAGCAGGC TCCGATGGCC 120 

GGAGCCCTGA ACAGGAAGGA GAGTTTCTTG CTCCTCTCCC TGCACAACCG CCTGCGCAGC 180 

TGGGTCCAGC CCCCTGCGGC TGACATGCGG AGGCTGGACT GGAGTGACAG CCTGGCCCAA 240 

CTGGCTCAAG CCAGGGCAGC CCTCTGTGGA ATCCCAACCC CGAGCCTGGC GTCCGG CCTG 300 

TGGCGCACCC TGCAAGTGGG CTGGAACATG CAGCTGCTGC CCGCGGGCTT GGCGTCCTTT 360 

GTTGAAGTGG TCAGCCTATG GTTTGCAGAG GGGCAGCGGT ACAGCCACGC GGCAGGAGAG 420 

TGTGCTCGCA ACGCCACCTG CACCCACTAC ACGCAGCTCG TGTGGGCCAC CTCAAGCCAG 480 

CTGGGCTGTG GGCGGCACCT GTGCTCTGCA GGCCAGGCAG CGATAGAAGC CTTTGTCTGT 540 

GCCTACTCCC CCGGAGGCAA CTGGGAGGTC AACGGGAAGA CAATCATCCC CTATAAGAAG 600 

GGTGCCTGGT GTTCGCTCTG CACAGCCAGT GTCTCAGGCT GCTTCAAAGC CTGGGACCAT 660 

GCAGGGGGGC TCTGTGAGGT CCCCAGGAAT CCTTGTCGCA TGAGCTGCCA GAACCATGGA 720 

CGTCTCAACA TCAGCACCTG CCACTGCCAC TGTCCCCCTG GCTACACGGG CAGATACTGC 780 

CAAGTGAGGT GCAGCCTGCA GTGTGTGCAC GGCCGGTTCC GGGAGGAGGA GTGCTCGTGC 840 

GTCTGTGACA TCGGCTACGG GGGAGCCCAG TGTGCCACCA AGGTGCATTT TCCCTTCCAC 900 

ACCTGTGACC TGAGGATCGA CGGAGACTGC TTCATGGTGT CTTCAGAGGC AGACACCTAT 960 

TACAGAGCCA GGATGAAATG TCAGAGGAAA GGCGGGGTGC TGGCCCAGAT CAAGAGCCAG 1020 

AAAGTGCAGG ACATCCTCGC CTTCTATCTG GGCCGCCTGG AGACCACCAA CGAGGTGATT 1080 

GACAGTGACT TCGAGACCAG GAACTTCTGG ATCGGGCTCA CCTACAAGAC CGCCAAGGAC 1140 

TCCTTCCGCT GGGCCACAGG GGAGCACCAG GCCTTCACCA GTTTTGCCTT TGGGCAGCCT 1200 

GACAACCACG GGTTTGGCAA CTGCGTGGAG CTGCAGGCTT CAGCTGCCTT CAACTGGAAC 1260 

AACCAGCGCT GCAAAACCCG AAACCGTTAC ATCTGCCAGT TTGCCCAGGA GCACATCTCC 1320 

CGGTGGGGCC CAGGGTCC 1338 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4: 



Glu 


Val 


Trp 


Pro 


Pro 


Gin 


Leu 


Gin 


Glu 


Gin 


Ala 


Pro Met Ala Gly Ala 


1 






5 










10 




15 


Leu 


Asn 


Arg 


Lys 


Glu 


Ser 


Phe 


Leu 


Leu 


Leu 


Ser 


Leu His Asn Arg Leu 






20 










25 






30 


Arg 


Ser 


Trp 


Val 


Gin 


Pro 


Pro 


Ala 


Ala 


Asp 


Met 


Arg Arg Leu Asp Trp 




35 










40 








45 


Ser 


Asp 
50 


Ser 


Leu 


Ala 


Gin 


Leu 
55 


Ala 


Gin 


Ala 


Arg 


Ala Ala Leu Cys Gly 
60 


He 


Pro 


Thr 


Pro 


Ser 


Leu 


Ala 


Ser 


Gly 


Leu 


Trp 


Arg Thr Leu Gin Val 


65 










70 










75 


80 


Gly 


Trp 


Asn 


Met 


Gin 


Leu 


Leu 


Pro 


Ala 


Gly 


Leu 


Ala Ser Phe Val Glu 






85 










90 




95 


Val 


Val 


Ser 


Leu 
100 


Trp 


Phe 


Ala 


Glu 


Gly 
105 


Gin 


Arg 


Tyr Ser His Ala Ala 
110 


Gly 


Glu 


Cys 


Ala 


Arg 


Asn 


Ala 


Thr 


Cys 


Thr 


His 


Tyr Thr Gin Leu Val 



115 120 125 



WO 99/54343 



6 



PCT/US99/08896 



Trp 


Ala 


Thr 


Ser 




130 






Gly 


Gin 


Ala 


Ala 


145 








Asn 


Trp 


Glu 


Val 


Trp 


Cys 


Ser 


Leu 








180 


Asp 


His 


Ala 


Gly 






195 




Ser 


Cys 


Gin 


Asn 










Cys 


Pro 


Pro 


Gly 


225 








Gin 


Cys 


Val 


His 


Asp 


lie 


Gly 


Tyr 








260 


Phe 


His 


Thr 


Cys 






275 




Ser 


Glu 


Ala 


Asp 




290 






Gly 


Gly 


Val 


Leu 


305 








Ala 


Phe 


Tyr 


Leu 


Asp 


Phe 


Glu 


Thr 






340 


Lys 


Asp 


Ser 


Phe 






355 




Phe 


Ala 


Phe 


Gly 




370 






Leu 


Gin 


Ala 


Ser 


385 








Arg 


Asn 


Arg 


Tyr 


Gly 


Pro 


Gly 


Ser 








420 



Ser Gin 

He Glu 
150 
Asn Gly 
165 

Cys Thr 

Gly Leu 

His Gly 

Tyr Thr 
230 
Gly Arg 
245 

Gly Gly 

Asp Leu 

Thr Tyr 

Ala Gin 
310 
Gly Arg 
325 

Arg Asn 

Arg Trp 

Gin Pro 

Ala Ala 
390 
He Cys 
405 



Leu Gly 
135 

Ala Phe 

Lys Thr 

Ala Ser 

Cys Glu 
200 
Arg Leu 
215 

Gly Arg 

Phe Arg 

Ala Gin 

Arg He 
280 
Tyr Arg 
295 

He Lys 

Leu Glu 

Phe Trp 

Ala Thr 
360 
Asp Asn 
375 

Phe Asn 
Gin Phe 



Cys Gly 
Val Cys 

He He 
170 
Val Ser 
185 

Val Pro 

Asn He 

Tyr Cys 

Glu Glu 
250 
Cys Ala 
265 

Asp Gly 

Ala Arg 

Ser Gin 

Thr Thr 
330 
He Gly 
345 

Gly Glu 

His Gly 

Trp Asn 

Ala Gin 
410 



Arg His Leu 
140 

Ala Tyr Ser 
155 

Pro Tyr Lys 

Gly. Cys Phe 

Arg Asn Pro 
205 

Ser Thr Cys 

220 
Gin Val Arg 
235 

Glu Cys Ser 

Thr Lys Val 

Asp Cys Phe 
285 

Met Lys Cys 

300 
Lys Val Gin 
315 

Asn Glu Val 

Leu Thr Tyr 

His Gin Ala 
365 

Phe Gly Asn 

380 
Asn Gin Arg 
395 

Glu His He 



Cys Ser Ala 

Pro Gly Gly 
160 

Lys Gly Ala 

175 
Lys Ala Trp 
190 

Cys Arg Met 

His Cys His 

Cys Ser Leu 
240 

Cys Val Cys 

255 
His Phe Pre 
270 

Met Val Ser 

Gin Arg Lys 

Asp He Leu 
320 

lie Asp Ser 

335 
Lys Thr Ala 
350 

Phe Thr Ser 

Cys Val Glu 

Cys Lys Thr 
400 

Ser Arg Trp 
415 



